Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
philipp-zettl 's Collections
RAG_STACK
SO-Prep
VLMs
LargeWurstModels
Chess ♟️
F(T5+1)
ToS'
summarization
good-summaries
embedding-models
llamas
not closed TTS
sd-1.5
NPC models
secret sauce FLUX
ImageNet(s)
BG-RM
OCR

VLMs

updated Sep 24
Upvote
-

  • baidu/ERNIE-4.5-VL-28B-A3B-PT

    Image-Text-to-Text • 29B • Updated Sep 1 • 69.6k • • 86

    Note apache2.0


  • Qwen/Qwen2.5-VL-3B-Instruct

    Image-Text-to-Text • 4B • Updated Apr 6 • 7.71M • 545

    Note https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct/blob/main/LICENSE requires commercial license [upon request]


  • Qwen/Qwen2.5-VL-7B-Instruct

    Image-Text-to-Text • 8B • Updated Apr 6 • 4.77M • • 1.33k

    Note apache2.0


  • Qwen/Qwen2-VL-2B-Instruct

    Image-Text-to-Text • 2B • Updated Jan 12 • 2.36M • 462

    Note apache2.0


  • Qwen/Qwen2-VL-7B

    Image-Text-to-Text • 8B • Updated Jan 12 • 3.54k • 61

    Note apache2.0


  • moonshotai/Kimi-VL-A3B-Thinking-2506

    Image-Text-to-Text • 16B • Updated Aug 18 • 178k • 316

    Note mit


  • vikhyatk/moondream2

    Image-Text-to-Text • 2B • Updated Sep 23 • 1.86M • 1.33k

    Note apache2.0

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs