RASPIAUDIO commited on
Commit
45d7137
·
1 Parent(s): 274613e

french version 1.01

Browse files
Files changed (1) hide show
  1. app.py +30 -3
app.py CHANGED
@@ -753,11 +753,38 @@ with gr.Blocks() as app:
753
  This is a local web UI for F5 TTS with advanced batch processing support. This app supports the following TTS models:
754
 
755
  * [F5-TTS](https://arxiv.org/abs/2410.06885) (A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching)
756
- * [E2 TTS](https://arxiv.org/abs/2406.18009) (Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS)
757
 
758
- The checkpoint support French
759
 
760
- Generations using CPU takes usually 2-3 minutes
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
761
 
762
  If you're having issues, try converting your reference audio to WAV or MP3, clipping it to 15s, and shortening your prompt.
763
 
 
753
  This is a local web UI for F5 TTS with advanced batch processing support. This app supports the following TTS models:
754
 
755
  * [F5-TTS](https://arxiv.org/abs/2410.06885) (A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching)
 
756
 
 
757
 
758
+ # F5 TTS French Model
759
+
760
+ This model was trained exclusively on French public domain material from [LibriVox](https://librivox.org/) and voice recordings. It is designed for text-to-speech synthesis in French and has gone through two rounds of training:
761
+
762
+ 1. **Round 1**:
763
+ - **Samples**: 80k
764
+ - **Epochs**: 100
765
+ - **Speakers**: Single speaker
766
+
767
+ 2. **Round 2**:
768
+ - **Samples**: 120k
769
+ - **Epochs**: 150
770
+ - **Speakers**: Multiple speakers from different corpus
771
+
772
+ ### Model Checkpoint
773
+ The checkpoint supports French and can be [downloaded here](https://huggingface.co/RASPIAUDIO/F5-French-MixedSpeakers-reduced).
774
+
775
+ ### Online Inference
776
+ You can test the model online using our [Hugging Face Space](https://huggingface.co/spaces/RASPIAUDIO/f5-tts_french).
777
+
778
+ ### Resources
779
+ - [Tutorial video to train a new language model](https://www.youtube.com/watch?v=UO4usaOojys)
780
+ - [Discussion about this training can be found here](https://github.com/SWivid/F5-TTS/issues/434)
781
+
782
+ ### About Us
783
+ Check out our website: [https://raspiaudio.com](https://raspiaudio.com) – We create cool, hackable audio devices that let you experiment with audio, voice assistants, and AI using ESP32 and Raspberry Pi!
784
+
785
+
786
+
787
+ Generations using CPU takes a very long time on the free CPU HF machine so I recommend to upgrade to any GPU machine
788
 
789
  If you're having issues, try converting your reference audio to WAV or MP3, clipping it to 15s, and shortening your prompt.
790