Real-time video captioning powered by FastVLM
In-browser unified multimodal understanding and generation.