@mrfakename on Hugging Face: "Hi everyone, I just launched TTS Arena V2

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

mrfakename

posted an update May 2

Post

8076

Hi everyone,

I just launched TTS Arena V2 - a platform for benchmarking TTS models by blind A/B testing. The goal is to make it easy to compare quality between open-source and commercial models, including conversational ones.

What's new in V2:

- **Conversational Arena**: Evaluate models like CSM-1B, Dia 1.6B, and PlayDialog in multi-turn settings
- **Personal Leaderboard**: Optional login to see which models you tend to prefer
- **Multi-speaker TTS**: Random voices per generation to reduce speaker bias
- **Performance Upgrade**: Rebuilt from Gradio → Flask. Much faster with fewer failed generations.
- **Keyboard Shortcuts**: Vote entirely via keyboard

Also added models like MegaTTS 3, Cartesia Sonic, and ElevenLabs' full lineup.

I'd love any feedback, feature suggestions, or ideas for models to include.

TTS-AGI/TTS-Arena-V2

9voltfan2009

May 2

Awesome!

zeeshiii05

May 22

Hello Dear, (E2-F5-TTS) This Model is not working please solve issues as soon as possible

Pmfried

May 29

*Question...........Is there any coding we can insert for pauses?

Suggestion: Put in some kind of mechanism for tone adjustment

mrfakename

May 30

Unfortunately not all models support this so I'm not sure this would be feasbile

Pmfried

Jul 21

What is "LJSpeech," and why is there only one voice to use??

hasnain213

Aug 12

I’m looking for the open-source code for the TTS model. Here’s the link to the Hugging Face Space: https://huggingface.co/spaces/mrfakename/E2-F5-TTS"

In this post