Celestine Floquet's picture

7 11

Celestine Floquet

Celestine-floquet

·

AI & ML interests

cute voice models, anime waifus

Recent Activity

upvoted a paper 27 days ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

upvoted a paper 27 days ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

upvoted a paper 27 days ago

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

View all activity

Organizations

models 0

None public yet

datasets 0

None public yet