Edit and enhance images based on descriptive instructions
Transcribe audio files or YouTube videos into text