Celestine Floquet's picture

7 11

Celestine Floquet

Celestine-floquet

·

AI & ML interests

cute voice models, anime waifus

Recent Activity

upvoted a paper about 1 month ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

upvoted a paper about 1 month ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

upvoted a paper about 1 month ago

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

View all activity

Organizations

upvoted 3 papers about 1 month ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26 • 132

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26 • 117

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

Paper • 2509.23873 • Published Sep 28 • 67

upvoted a paper 2 months ago

FLARE: Fast Low-rank Attention Routing Engine

Paper • 2508.12594 • Published Aug 18 • 7

upvoted 2 papers 4 months ago

Lizard: An Efficient Linearization Framework for Large Language Models

Paper • 2507.09025 • Published Jul 11 • 18

Fast and Simplex: 2-Simplicial Attention in Triton

Paper • 2507.02754 • Published Jul 3 • 26

upvoted a collection 7 months ago

blt

4 items • Updated Apr 17 • 27