Transcribe or translate audio from microphone, file, or YouTube
High-fidelity Text-To-Speech
Generate music from text prompts