UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation Paper • 2510.18701 • Published 12 days ago • 66
VPPO Model Collection SOTA models for multimodal reasoning, fine-tuned with VPPO. Achieves superior performance by focusing on critical visual tokens. • 3 items • Updated 20 days ago • 3
Spotlight on Token Perception for Multimodal Reinforcement Learning Paper • 2510.09285 • Published 23 days ago • 35 • 3
Spotlight on Token Perception for Multimodal Reinforcement Learning Paper • 2510.09285 • Published 23 days ago • 35 • 3
Spotlight on Token Perception for Multimodal Reinforcement Learning Paper • 2510.09285 • Published 23 days ago • 35
UniVideo: Unified Understanding, Generation, and Editing for Videos Paper • 2510.08377 • Published 24 days ago • 68
Diversity-Incentivized Exploration for Versatile Reasoning Paper • 2509.26209 • Published Sep 30 • 16
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 27 days ago • 462
GRACE: Generative Representation Learning via Contrastive Policy Optimization Paper • 2510.04506 • Published 27 days ago • 10
FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting Paper • 2509.24304 • Published Sep 29 • 4 • 3
FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting Paper • 2509.24304 • Published Sep 29 • 4
FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting Paper • 2509.24304 • Published Sep 29 • 4 • 3
ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data Paper • 2509.15221 • Published Sep 18 • 109
Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration Paper • 2509.14760 • Published Sep 18 • 52
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning Paper • 2509.09674 • Published Sep 11 • 78
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10 • 185