Running on Zero 131 131 IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System 🎙 Generate audio from text using a reference audio sample
YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation Paper • 2407.04822 • Published Jul 5, 2024 • 4
Running 329 329 Qwen2.5 Omni 7B Demo 🏆 Generate text and speech responses from text, images, or audio input