Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
mrfakename 
posted an update May 2
Post
7164
Hi everyone,

I just launched TTS Arena V2 - a platform for benchmarking TTS models by blind A/B testing. The goal is to make it easy to compare quality between open-source and commercial models, including conversational ones.

What's new in V2:

- **Conversational Arena**: Evaluate models like CSM-1B, Dia 1.6B, and PlayDialog in multi-turn settings
- **Personal Leaderboard**: Optional login to see which models you tend to prefer
- **Multi-speaker TTS**: Random voices per generation to reduce speaker bias
- **Performance Upgrade**: Rebuilt from Gradio → Flask. Much faster with fewer failed generations.
- **Keyboard Shortcuts**: Vote entirely via keyboard

Also added models like MegaTTS 3, Cartesia Sonic, and ElevenLabs' full lineup.

I'd love any feedback, feature suggestions, or ideas for models to include.

TTS-AGI/TTS-Arena-V2

Awesome!

Hello Dear, (E2-F5-TTS) This Model is not working please solve issues as soon as possible

*Question...........Is there any coding we can insert for pauses?

Suggestion: Put in some kind of mechanism for tone adjustment

·

Unfortunately not all models support this so I'm not sure this would be feasbile

What is "LJSpeech," and why is there only one voice to use??

I’m looking for the open-source code for the TTS model. Here’s the link to the Hugging Face Space: https://huggingface.co/spaces/mrfakename/E2-F5-TTS"