Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
YI REN's picture
5

YI REN

Joshua-Ren
·
https://joshua-ren.github.io/
  • Joshua-Ren

AI & ML interests

LLM, Cognitive science

Recent Activity

upvoted a collection 1 day ago
Gemma 3 Release
upvoted a paper 19 days ago
On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral
upvoted a paper 2 months ago
Token Hidden Reward: Steering Exploration-Exploitation in Group Relative Deep Reinforcement Learning
View all activity

Organizations

None yet

upvoted a collection 1 day ago

Gemma 3 Release

Collection
28 items • Updated Aug 11 • 572
upvoted a paper 19 days ago

On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral

Paper • 2512.04220 • Published 26 days ago • 11
upvoted 2 papers 2 months ago

Token Hidden Reward: Steering Exploration-Exploitation in Group Relative Deep Reinforcement Learning

Paper • 2510.03669 • Published Oct 4 • 1

SimKO: Simple Pass@K Policy Optimization

Paper • 2510.14807 • Published Oct 16 • 10
upvoted a paper 8 months ago

Learning Dynamics of LLM Finetuning

Paper • 2407.10490 • Published Jul 15, 2024 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs