Update README.md
Browse files
README.md
CHANGED
@@ -7,6 +7,8 @@ library_name: onnx
|
|
7 |
tags:
|
8 |
- piper
|
9 |
- tts
|
|
|
|
|
10 |
---
|
11 |
|
12 |
## ⚠️ Initial Checkpoint
|
@@ -21,7 +23,7 @@ This model is after just 5 epochs on ~30% of total data I curated (synthetic + n
|
|
21 |
<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/659be8bbb0f43ed69f17e7b8/8bRegOeimX1A6VCyjQUW-.wav"></audio>
|
22 |
<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/659be8bbb0f43ed69f17e7b8/-ww0FdqtPPnTwZ2Kasl54.wav"></audio>
|
23 |
|
24 |
-
Currently, I'm refining the dataset as I'm not satisfied with its quality. I will resume finetuning after.
|
25 |
|
26 |
Also running ablations on the best ratio of synthetic and natural data.
|
27 |
|
@@ -31,6 +33,8 @@ Trying to push the boundaries of audio generated by a mere 63 MB model.
|
|
31 |
|
32 |
## 🙏 Acknowledgements
|
33 |
|
34 |
-
[Bryce Beattie](https://brycebeattie.com/files/tts/)
|
|
|
|
|
35 |
|
36 |
[Piper TTS](https://github.com/rhasspy/piper)
|
|
|
7 |
tags:
|
8 |
- piper
|
9 |
- tts
|
10 |
+
datasets:
|
11 |
+
- Jinsaryko/Elise
|
12 |
---
|
13 |
|
14 |
## ⚠️ Initial Checkpoint
|
|
|
23 |
<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/659be8bbb0f43ed69f17e7b8/8bRegOeimX1A6VCyjQUW-.wav"></audio>
|
24 |
<audio controls src="https://cdn-uploads.huggingface.co/production/uploads/659be8bbb0f43ed69f17e7b8/-ww0FdqtPPnTwZ2Kasl54.wav"></audio>
|
25 |
|
26 |
+
Currently, I'm refining the synthetic dataset as I'm not satisfied with its quality. I will resume finetuning after.
|
27 |
|
28 |
Also running ablations on the best ratio of synthetic and natural data.
|
29 |
|
|
|
33 |
|
34 |
## 🙏 Acknowledgements
|
35 |
|
36 |
+
[Bryce Beattie](https://brycebeattie.com/files/tts/) for training the Kristin model.
|
37 |
+
|
38 |
+
Reference Audio from datasets by [@Jinsaryko](https://huggingface.co/Jinsaryko)
|
39 |
|
40 |
[Piper TTS](https://github.com/rhasspy/piper)
|