microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 179k • 1.56k
nomic-ai/colnomic-embed-multimodal-7b Visual Document Retrieval • Updated Apr 15, 2025 • 3.06k • 99
Qwen/Qwen2.5-VL-32B-Instruct Image-Text-to-Text • 33B • Updated Apr 14, 2025 • 2.34M • • 474
moondream/moondream-2b-2025-04-14-4bit Image-Text-to-Text • 1B • Updated May 22, 2025 • 3.55k • 61
Alibaba-NLP/gme-Qwen2-VL-2B-Instruct Sentence Similarity • 2B • Updated Jun 9, 2025 • 32.5k • 130