view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 21 days ago • 105
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published 27 days ago • 37
view article Article Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model 6 days ago • 15
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation • 15B • Updated Aug 27, 2025 • 4.61k • 104
SpecBundle Collection A collection of production-grade draft models for speculative decoding • 14 items • Updated 15 days ago • 13
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published 23 days ago • 73
Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation Paper • 2512.16913 • Published 20 days ago • 33