FastVLM Collection Efficient Vision Encoding for Vision Language Models β’ 9 items β’ Updated Sep 2, 2025 β’ 106
MobileCLIP2 Collection MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B β’ 37 items β’ Updated Sep 18, 2025 β’ 57
google/siglip-so400m-patch14-384 Zero-Shot Image Classification β’ 0.9B β’ Updated Sep 26, 2024 β’ 6.14M β’ 634
Nomic Embed Vision Collection Vision Encoders aligned to Nomic Embed Text making Nomic Embed multimodal! β’ 2 items β’ Updated Jun 5, 2024 β’ 10
nomic-ai/nomic-embed-vision-v1.5 Image Feature Extraction β’ 92.9M β’ Updated Mar 31, 2025 β’ 100k β’ 204
Running Featured 558 Vision Arena (Testing VLMs side-by-side) πΌ 558 Display image analysis results
Qwen/Qwen2.5-Coder-32B-Instruct Text Generation β’ 33B β’ Updated Jan 12, 2025 β’ 182k β’ β’ 1.96k