Plan-X: Instruct Video Generation via Semantic Planning Paper • 2511.17986 • Published Nov 22, 2025 • 17
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models Paper • 2507.23682 • Published Jul 31, 2025 • 23
Long-Video Audio Synthesis with Multi-Agent Collaboration Paper • 2503.10719 • Published Mar 13, 2025 • 9