Spaces:
Running
Running
RASPIAUDIO
commited on
Commit
·
45d7137
1
Parent(s):
274613e
french version 1.01
Browse files
app.py
CHANGED
@@ -753,11 +753,38 @@ with gr.Blocks() as app:
|
|
753 |
This is a local web UI for F5 TTS with advanced batch processing support. This app supports the following TTS models:
|
754 |
|
755 |
* [F5-TTS](https://arxiv.org/abs/2410.06885) (A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching)
|
756 |
-
* [E2 TTS](https://arxiv.org/abs/2406.18009) (Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS)
|
757 |
|
758 |
-
The checkpoint support French
|
759 |
|
760 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
761 |
|
762 |
If you're having issues, try converting your reference audio to WAV or MP3, clipping it to 15s, and shortening your prompt.
|
763 |
|
|
|
753 |
This is a local web UI for F5 TTS with advanced batch processing support. This app supports the following TTS models:
|
754 |
|
755 |
* [F5-TTS](https://arxiv.org/abs/2410.06885) (A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching)
|
|
|
756 |
|
|
|
757 |
|
758 |
+
# F5 TTS French Model
|
759 |
+
|
760 |
+
This model was trained exclusively on French public domain material from [LibriVox](https://librivox.org/) and voice recordings. It is designed for text-to-speech synthesis in French and has gone through two rounds of training:
|
761 |
+
|
762 |
+
1. **Round 1**:
|
763 |
+
- **Samples**: 80k
|
764 |
+
- **Epochs**: 100
|
765 |
+
- **Speakers**: Single speaker
|
766 |
+
|
767 |
+
2. **Round 2**:
|
768 |
+
- **Samples**: 120k
|
769 |
+
- **Epochs**: 150
|
770 |
+
- **Speakers**: Multiple speakers from different corpus
|
771 |
+
|
772 |
+
### Model Checkpoint
|
773 |
+
The checkpoint supports French and can be [downloaded here](https://huggingface.co/RASPIAUDIO/F5-French-MixedSpeakers-reduced).
|
774 |
+
|
775 |
+
### Online Inference
|
776 |
+
You can test the model online using our [Hugging Face Space](https://huggingface.co/spaces/RASPIAUDIO/f5-tts_french).
|
777 |
+
|
778 |
+
### Resources
|
779 |
+
- [Tutorial video to train a new language model](https://www.youtube.com/watch?v=UO4usaOojys)
|
780 |
+
- [Discussion about this training can be found here](https://github.com/SWivid/F5-TTS/issues/434)
|
781 |
+
|
782 |
+
### About Us
|
783 |
+
Check out our website: [https://raspiaudio.com](https://raspiaudio.com) – We create cool, hackable audio devices that let you experiment with audio, voice assistants, and AI using ESP32 and Raspberry Pi!
|
784 |
+
|
785 |
+
|
786 |
+
|
787 |
+
Generations using CPU takes a very long time on the free CPU HF machine so I recommend to upgrade to any GPU machine
|
788 |
|
789 |
If you're having issues, try converting your reference audio to WAV or MP3, clipping it to 15s, and shortening your prompt.
|
790 |
|