Text-to-Speech
Finnish
File size: 1,726 Bytes
36e015a
 
 
 
 
122c149
36e015a
 
 
 
f817a3b
2ab0b09
 
62ba1d7
e0e22e1
8a137ed
 
8b9bf83
e0e22e1
 
 
ffb8ed5
e0e22e1
8486f2f
e0e22e1
78761f0
 
 
104e79a
78761f0
 
8b9bf83
e0e22e1
62ba1d7
e0e22e1
ffb8ed5
fafc1c5
8486f2f
77d14db
78761f0
 
 
104e79a
78761f0
 
62ba1d7
 
4f2ecee
62ba1d7
ffb8ed5
62ba1d7
 
 
 
 
 
 
 
 
d904675
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
---
license: cc-by-nc-4.0
datasets:
- mozilla-foundation/common_voice_17_0
- facebook/voxpopuli
- mrfakename/librivox-full-catalog-archive
language:
- fi
base_model:
- SWivid/F5-TTS
pipeline_tag: text-to-speech
---

Here are three Finnish models of the F5-TTS, listen speech samples for models.

Numbers cannot be understood by models. Convert numbers to words.

--- --- ---

The Common Voice and Vox Populi Finnish datasets are used for the first round.

- 20241206 (v0)

- Speakers: Several speakers from different corpus

- Use these with "f5-tts_infer-gradio":

Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_common_voice_fi_vox_populi_fi_20241206.safetensors

Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/vocab.txt

--- --- ---

The second round is based on the Common Voice, LibriVox and Vox Populi Finnish data sets.

- 20241217 (v0)

- Speakers: Several speakers from different corpus

- Use these with "f5-tts_infer-gradio":

Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20241217/model_last_20241217.safetensors

Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20241217/vocab.txt

--- --- ---

The third round is based on the Common Voice, LibriVox and Vox Populi Finnish data sets.

- 20250323 (v1)

- Speakers: Several speakers from different corpus

- Use these with "f5-tts_infer-gradio":

Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20250323/model_last_20250323.safetensors

Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20250323/vocab.txt

There is example script in that directory: CLI_Example_Generating_Audio.txt