VoxCPM
Long-form multi-speaker dialogue generation
Kontext image editing on FLUX[dev]
Generate custom songs from lyrics and prompts
Generate speech from text