None defined yet.
PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing
Real-time video captioning powered by FastVLM