VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published Dec 11, 2025 • 43 • 5
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published Dec 11, 2025 • 43 • 5
Running on Zero MCP Featured 1.45k LTX Video Fast 🎥 1.45k ultra-fast video model, LTX 0.9.8 13B distilled
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 14 days ago • 161
Running on A100 Featured 1.17k LoRA the Explorer SDXL 🔎 1.17k Explore fun LoRAs and generate with SDXL