ERGO: Efficient High-Resolution Visual Understanding for Vision-Language Models Paper • 2509.21991 • Published Sep 26, 2025 • 5
ERGO: Efficient High-Resolution Visual Understanding for Vision-Language Models Paper • 2509.21991 • Published Sep 26, 2025 • 5
ERGO: Efficient High-Resolution Visual Understanding for Vision-Language Models Paper • 2509.21991 • Published Sep 26, 2025 • 5 • 2
ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers Paper • 2504.00502 • Published Apr 1, 2025 • 26
Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features Paper • 2504.00557 • Published Apr 1, 2025 • 15
Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features Paper • 2504.00557 • Published Apr 1, 2025 • 15