Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published about 13 hours ago • 22
Facilitating Proactive and Reactive Guidance for Decision Making on the Web: A Design Probe with WebSeek Paper • 2601.15100 • Published about 16 hours ago
Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning Paper • 2601.14750 • Published about 24 hours ago • 9
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models Paper • 2601.10387 • Published 7 days ago • 9
FrankenMotion: Part-level Human Motion Generation and Composition Paper • 2601.10909 • Published 6 days ago • 17
AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems Paper • 2601.11354 • Published 6 days ago • 3
BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search Paper • 2601.11037 • Published 6 days ago • 15
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 7 days ago • 31
FlowAct-R1: Towards Interactive Humanoid Video Generation Paper • 2601.10103 • Published 7 days ago • 29
Inference-time Physics Alignment of Video Generative Models with Latent World Models Paper • 2601.10553 • Published 7 days ago • 11
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 7 days ago • 25
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding Paper • 2601.09575 • Published 8 days ago • 24
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines Paper • 2601.09465 • Published 8 days ago • 39