TTS Arena
Vote on the latest TTS models!
Projects I've worked on
Vote on the latest TTS models!
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Note Unofficial demo for E2/F5-TTS, which supports zero-shot voice cloning. Not affiliated with the authors of F5-TTS
A demo of OpenDalle V1.1 on a ZERO GPU.
Note Note: I did not create the model, just the demo.
Fast & efficient ASR outperforming Whisper!
Note Unofficial demo for the Moonshine ASR model, an efficient & fast ASR model by Useful Sensors Moonshine ASR: https://github.com/usefulsensors/moonshine
Did StyleTTS 2 generate that audio?!?
Note A Whipser-based audio classification model to detect StyleTTS 2
Fast, efficient, & multilingual text-to-speech
Note Demo for MeloTTS: Multilingual, multispeaker text-to-speech licensed under the MIT license
Generate MIDI music using RWKV v4!
Note My newest project, a demo of RWKV 4 Music (the MIDI model).
Efficient, fast, and natural text to speech with StyleTTS 2!
Note My most successful project: an online demo for StyleTTS 2. Reached HF Spaces of the Week and was the most popular Space of the Week. Note: I did not create StyleTTS 2, just the demo.
Note A multilingual dataset of text-phoneme pairs supporting 15 languages.
Obsolete, use official version instead