Thinking with Programming Vision: Towards a Unified View for Thinking with Images Paper • 2512.03746 • Published 25 days ago • 15
Observe-R1: Unlocking Reasoning Abilities of MLLMs with Dynamic Progressive Reinforcement Learning Paper • 2505.12432 • Published May 18
APO: Enhancing Reasoning Ability of MLLMs via Asymmetric Policy Optimization Paper • 2506.21655 • Published Jun 26
LLM-I: LLMs are Naturally Interleaved Multimodal Creators Paper • 2509.13642 • Published Sep 17 • 9 • 2
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition Paper • 2407.05374 • Published Jul 7, 2024
Classifier-guided Gradient Modulation for Enhanced Multimodal Learning Paper • 2411.01409 • Published Nov 3, 2024