Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models Paper • 2510.05034 • Published Oct 6, 2025 • 50
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26, 2025 • 185
A Survey on Video Temporal Grounding with Multimodal Large Language Model Paper • 2508.10922 • Published Aug 7, 2025 • 1
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 24 days ago • 553