Qwen/Qwen3-VL-30B-A3B-Instruct Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 753k • • 515
Video models are zero-shot learners and reasoners Paper • 2509.20328 • Published Sep 24, 2025 • 100
meituan-longcat/LongCat-Flash-Thinking Text Generation • 562B • Updated Sep 24, 2025 • 69 • 147
google/embeddinggemma-300m Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 714k • • 1.43k
ByteDance-Seed/Seed-OSS-36B-Instruct Text Generation • 36B • Updated Aug 26, 2025 • 7.89k • 477
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7, 2025 • 181
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 465