From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model Paper • 2510.19871 • Published 8 days ago • 28 • 2
Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published 9 days ago • 61
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs Paper • 2510.18876 • Published 8 days ago • 35
InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training Paper • 2510.15859 • Published 12 days ago • 10
InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training Paper • 2510.15859 • Published 12 days ago • 10 • 2
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published 16 days ago • 160
Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling Paper • 2510.01329 • Published 28 days ago • 5 • 3
Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling Paper • 2510.01329 • Published 28 days ago • 5
Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking Paper • 2505.20199 • Published May 26 • 1
GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling Paper • 2506.22049 • Published Jun 27 • 2
Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning Paper • 2506.21285 • Published Jun 26
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization Paper • 2508.05731 • Published Aug 7 • 25
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving Paper • 2509.20109 • Published Sep 24 • 3
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving Paper • 2509.20109 • Published Sep 24 • 3