Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective Paper • 2509.22613 • Published Sep 26 • 9
DocReward: A Document Reward Model for Structuring and Stylizing Paper • 2510.11391 • Published 20 days ago • 26
Information-Preserving Reformulation of Reasoning Traces for Antidistillation Paper • 2510.11545 • Published 20 days ago • 1
Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs Paper • 2510.24514 • Published 5 days ago • 20
The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published 3 days ago • 21
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning Paper • 2506.08889 • Published Jun 10 • 23
Model as a Game: On Numerical and Spatial Consistency for Generative Games Paper • 2503.21172 • Published Mar 27
Think Only When You Need with Large Hybrid-Reasoning Models Paper • 2505.14631 • Published May 20 • 20
Imagine while Reasoning in Space: Multimodal Visualization-of-Thought Paper • 2501.07542 • Published Jan 13 • 3
Zero-shot Cross-lingual Transfer of Neural Machine Translation with Multilingual Pretrained Encoders Paper • 2104.08757 • Published Apr 18, 2021
WildLong: Synthesizing Realistic Long-Context Instruction Data at Scale Paper • 2502.16684 • Published Feb 23 • 1