The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation Paper • 2510.23393 • Published 1 day ago • 12
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation Paper • 2510.23581 • Published 1 day ago • 39
VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting Paper • 2510.21817 • Published 7 days ago • 39
A Survey of Data Agents: Emerging Paradigm or Overstated Hype? Paper • 2510.23587 • Published 1 day ago • 51
Language Server CLI Empowers Language Agents with Process Rewards Paper • 2510.22907 • Published 2 days ago • 3
LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation Paper • 2510.22946 • Published 2 days ago • 11
LimRank: Less is More for Reasoning-Intensive Information Reranking Paper • 2510.23544 • Published 1 day ago • 7
E^2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker Paper • 2510.22733 • Published 2 days ago • 26
Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences Paper • 2510.23451 • Published 1 day ago • 22
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published 12 days ago • 38
RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling Paper • 2510.20206 • Published 6 days ago • 11
From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model Paper • 2510.19871 • Published 7 days ago • 28
UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning Paper • 2510.20286 • Published 6 days ago • 18