-
starriver030515/hapo_data
Viewer • Updated • 1.59k • 101 -
starriver030515/Qwen2.5-Math-1.5B-16k
Text Generation • 2B • Updated • 4 -
starriver030515/Qwen2.5-Math-7B-32k
Text Generation • 8B • Updated • 7.12k -
From Uniform to Heterogeneous: Tailoring Policy Optimization to Every Token's Nature
Paper • 2509.16591 • Published • 2
Zheng Liu
starriver030515
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
21 days ago
Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model
Reasoning
authored
a paper
30 days ago
MinerU2.5: A Decoupled Vision-Language Model for Efficient
High-Resolution Document Parsing