Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Panoramic X-ray Analysis Paper • 2509.09254 • Published Sep 11, 2025 • 6
Visual Position Prompt for MLLM based Visual Grounding Paper • 2503.15426 • Published Mar 19, 2025 • 2
MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams Paper • 2503.20745 • Published Mar 26, 2025 • 1
Artemis: Structured Visual Reasoning for Perception Policy Learning Paper • 2512.01988 • Published Dec 1, 2025 • 1
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory Paper • 2511.21678 • Published Nov 26, 2025 • 12
Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception Paper • 2412.14233 • Published Dec 18, 2024 • 6