li's picture

5 1

li

mimasss

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text

upvoted a paper 3 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

upvoted a paper 3 months ago

Scaling Agents via Continual Pre-training

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text

Paper • 2601.10355 • Published 5 days ago • 33

upvoted 2 papers 3 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 39

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

upvoted a paper 4 months ago

Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners

Paper • 2509.26226 • Published Sep 30, 2025 • 33

published 4 datasets 8 months ago

mimasss/llama3.2-3b-instruct-helpsteer2

Viewer • Updated Apr 16, 2025 • 9.09k • 46

mimasss/llama3-8b-instruct-uf

Viewer • Updated Apr 16, 2025 • 60.4k • 59

mimasss/llama3-8b-instruct-helpsteer2

Viewer • Updated Apr 16, 2025 • 9.58k • 174

mimasss/llama3.2-3b-instruct-uf

Viewer • Updated Apr 16, 2025 • 60.7k • 94

updated 4 datasets 9 months ago

mimasss/llama3.2-3b-instruct-helpsteer2

Viewer • Updated Apr 16, 2025 • 9.09k • 46

mimasss/llama3.2-3b-instruct-uf

Viewer • Updated Apr 16, 2025 • 60.7k • 94

mimasss/llama3-8b-instruct-helpsteer2

Viewer • Updated Apr 16, 2025 • 9.58k • 174

mimasss/llama3-8b-instruct-uf

Viewer • Updated Apr 16, 2025 • 60.4k • 59

upvoted a paper about 1 year ago

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published Dec 19, 2024 • 73

liked a Space over 1 year ago

Model Memory Utility

Calculate vRAM needed for model training and inference