Taming generative video models for zero-shot optical flow extraction Paper • 2507.09082 • Published Jul 11, 2025 • 12
CaptionQA: Is Your Caption as Useful as the Image Itself? Paper • 2511.21025 • Published Nov 26, 2025 • 27
CaptionQA: Is Your Caption as Useful as the Image Itself? Paper • 2511.21025 • Published Nov 26, 2025 • 27
IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos Paper • 2411.11409 • Published Nov 18, 2024
Yunong/llama-openpi_v2_llm_tasks_fill_with_other_steps_lora Text Generation • Updated Jan 11, 2024 • 7
Yunong/llama-13b-openpi_v2_llm_tasks_fill_with_other_steps Text Generation • Updated Jan 10, 2024 • 5
Yunong/mistral-openpi_v2_llm_tasks_fill_with_other_steps Text Generation • 7B • Updated Jan 6, 2024 • 5