license: cc-by-nc-4.0
datasets:
- mozilla-foundation/common_voice_17_0
- facebook/voxpopuli
- mrfakename/librivox-full-catalog-archive
language:
- fi
base_model:
- SWivid/F5-TTS
pipeline_tag: text-to-speech
Here are three Finnish models of the F5-TTS, listen speech samples for models.
Numbers cannot be understood by models. Convert numbers to words.
The Common Voice and Vox Populi Finnish datasets are used for the first round.
20241206 (v0)
Speakers: Several speakers from different corpus
Use these with "f5-tts_infer-gradio":
Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_common_voice_fi_vox_populi_fi_20241206.safetensors
Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/vocab.txt
The second round is based on the Common Voice, LibriVox and Vox Populi Finnish data sets.
20241217 (v0)
Speakers: Several speakers from different corpus
Use these with "f5-tts_infer-gradio":
Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20241217/model_last_20241217.safetensors
Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20241217/vocab.txt
The third round is based on the Common Voice, LibriVox and Vox Populi Finnish data sets.
20250323 (v1)
Speakers: Several speakers from different corpus
Use these with "f5-tts_infer-gradio":
Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20250323/model_last_20250323.safetensors
Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20250323/vocab.txt
There is example script in that directory: CLI_Example_Generating_Audio.txt