VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs Paper • 2509.25916 • Published Sep 30, 2025 • 3
RLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards Paper • 2509.21319 • Published Sep 25, 2025 • 6
Mixture of Thoughts: Learning to Aggregate What Experts Think, Not Just What They Say Paper • 2509.21164 • Published Sep 25, 2025 • 8
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation Paper • 2504.02782 • Published Apr 3, 2025 • 57
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models Paper • 2504.15279 • Published Apr 21, 2025 • 78
Hybrid 3D-4D Gaussian Splatting for Fast Dynamic Scene Representation Paper • 2505.13215 • Published May 19, 2025 • 29
LightLab: Controlling Light Sources in Images with Diffusion Models Paper • 2505.09608 • Published May 14, 2025 • 36
Improving Editability in Image Generation with Layer-wise Memory Paper • 2505.01079 • Published May 2, 2025 • 29
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models Paper • 2505.02735 • Published May 5, 2025 • 33
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale Paper • 2505.03005 • Published May 5, 2025 • 36
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models Paper • 2505.14810 • Published May 20, 2025 • 62