Chat with Xiaomi MiMo-Audio using voice
Generate images from scene prompts with camera parameters
A Step Towards Music Generation Foundation Model