WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks Paper • 2601.02439 • Published 2 days ago
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 3 days ago • 11
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published about 14 hours ago • 16
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 1 day ago • 15
OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment Paper • 2601.01576 • Published 3 days ago • 2
Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes Paper • 2601.02356 • Published 1 day ago • 11
VINO: A Unified Visual Generator with Interleaved OmniModal Context Paper • 2601.02358 • Published 1 day ago • 23
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published 5 days ago • 45
Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking Paper • 2512.24297 • Published 8 days ago • 5
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process Paper • 2512.23988 • Published 8 days ago • 15
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time Paper • 2512.25075 • Published 7 days ago • 13
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem Paper • 2512.24873 • Published 7 days ago • 91
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published 7 days ago • 116
Bridging Your Imagination with Audio-Video Generation via a Unified Director Paper • 2512.23222 • Published 9 days ago • 5