README.md · AsmoKoskinen/F5-TTS_Finnish

metadata

license: cc-by-nc-4.0
datasets:
  - mozilla-foundation/common_voice_17_0
  - facebook/voxpopuli
  - mrfakename/librivox-full-catalog-archive
language:
  - fi
base_model:
  - SWivid/F5-TTS
pipeline_tag: text-to-speech

Here are three Finnish models of the F5-TTS, listen speech samples for models.

Numbers cannot be understood by models. Convert numbers to words.

The Common Voice and Vox Populi Finnish datasets are used for the first round.

20241206 (v0)
Speakers: Several speakers from different corpus
Use these with "f5-tts_infer-gradio":

Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_common_voice_fi_vox_populi_fi_20241206.safetensors

Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/vocab.txt

The second round is based on the Common Voice, LibriVox and Vox Populi Finnish data sets.

20241217 (v0)
Speakers: Several speakers from different corpus
Use these with "f5-tts_infer-gradio":

Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20241217/model_last_20241217.safetensors

Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20241217/vocab.txt

The third round is based on the Common Voice, LibriVox and Vox Populi Finnish data sets.

20250323 (v1)
Speakers: Several speakers from different corpus
Use these with "f5-tts_infer-gradio":

Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20250323/model_last_20250323.safetensors

Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20250323/vocab.txt

There is example script in that directory: CLI_Example_Generating_Audio.txt