DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval Paper • 2506.08887 • Published Jun 10, 2025 • 4
FastVID: Dynamic Density Pruning for Fast Video Large Language Models Paper • 2503.11187 • Published Mar 14, 2025 • 1
TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval Paper • 2409.01156 • Published Sep 2, 2024