LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper β’ 2512.13604 β’ Published 23 days ago β’ 73
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper β’ 2512.08269 β’ Published 30 days ago β’ 116
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards Paper β’ 2512.00425 β’ Published Nov 29, 2025 β’ 50
Guided Self-Evolving LLMs with Minimal Human Supervision Paper β’ 2512.02472 β’ Published Dec 2, 2025 β’ 51
How Far Are We from Genuinely Useful Deep Research Agents? Paper β’ 2512.01948 β’ Published Dec 1, 2025 β’ 54
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper β’ 2512.02014 β’ Published Dec 1, 2025 β’ 71
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper β’ 2511.22570 β’ Published Nov 27, 2025 β’ 86
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper β’ 2511.21689 β’ Published Nov 26, 2025 β’ 112
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper β’ 2512.04324 β’ Published Dec 3, 2025 β’ 150
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling Paper β’ 2511.20785 β’ Published Nov 25, 2025 β’ 182
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper β’ 2512.04677 β’ Published Dec 4, 2025 β’ 167
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper β’ 2511.22699 β’ Published Nov 27, 2025 β’ 224
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper β’ 2512.02556 β’ Published Dec 2, 2025 β’ 245
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper β’ 2512.05965 β’ Published Dec 5, 2025 β’ 38
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation Paper β’ 2512.07831 β’ Published about 1 month ago β’ 16