deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24, 2025 • 808k • • 1.44k
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts Paper • 2505.10010 • Published May 15, 2025 • 2