Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction Paper • 2512.04987 • Published 29 days ago • 75
ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning Systems Paper • 2510.26475 • Published Oct 30, 2025
Video Generation Models Are Good Latent Reward Models Paper • 2511.21541 • Published Nov 26, 2025 • 45
Video Generation Models Are Good Latent Reward Models Paper • 2511.21541 • Published Nov 26, 2025 • 45
Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation Paper • 2506.04614 • Published Jun 5, 2025 • 19
Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration Paper • 2505.21471 • Published May 27, 2025 • 5
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published Apr 3, 2025 • 57
Enhancing Language Multi-Agent Learning with Multi-Agent Credit Re-Assignment for Interactive Environment Generalization Paper • 2502.14496 • Published Feb 20, 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22, 2025 • 432