InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published 16 days ago • 94 • 4
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space Paper • 2512.24617 • Published 4 days ago • 38 • 4
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement Paper • 2512.21185 • Published 11 days ago • 24 • 4
Leveraging LLMs for Legacy Code Modernization: Challenges and Opportunities for LLM-Generated Documentation Paper • 2411.14971 • Published Nov 22, 2024 • 1
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 6 days ago • 87 • 4
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published Dec 27, 2024 • 87 • 4
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published Jan 1, 2025 • 109 • 8
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Paper • 2512.20557 • Published 12 days ago • 48 • 4
Schoenfeld's Anatomy of Mathematical Reasoning by Language Models Paper • 2512.19995 • Published 12 days ago • 14 • 5
Spatia: Video Generation with Updatable Spatial Memory Paper • 2512.15716 • Published 18 days ago • 28 • 4
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning Paper • 2512.20605 • Published 12 days ago • 59 • 5
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows Paper • 2512.16969 • Published 17 days ago • 109 • 9
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling Paper • 2511.20785 • Published Nov 25, 2025 • 182 • 7
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published 17 days ago • 82 • 4
WorldGen: From Text to Traversable and Interactive 3D Worlds Paper • 2511.16825 • Published Nov 20, 2025 • 23 • 4