Update README.md
Browse files
README.md
CHANGED
@@ -11,16 +11,20 @@ base_model:
|
|
11 |
pipeline_tag: automatic-speech-recognition
|
12 |
---
|
13 |
|
14 |
-
|
|
|
15 |
|
16 |
-
Dataset split:
|
17 |
-
- test -> 50% validation, 50% test
|
18 |
- Train set duration: 206 hours 43 minutes
|
19 |
- Validation set duration: 2 hours 22 minutes
|
20 |
- Test set duration: 2 hours 22 minutes
|
21 |
|
22 |
-
Results:
|
23 |
- validation WER: 4.90%
|
24 |
- validation CER: 1.78%
|
25 |
-
- test WER: 4.89
|
26 |
-
- test CER: 2.06
|
|
|
|
|
|
|
|
11 |
pipeline_tag: automatic-speech-recognition
|
12 |
---
|
13 |
|
14 |
+
## Description
|
15 |
+
Fine-tuning Whisper Large V3 Turbo on zeroth Korean dataset.
|
16 |
|
17 |
+
## Dataset split:
|
18 |
+
- The test dataset from Korean zeroth is divided to test and validation -> 50% validation, 50% test
|
19 |
- Train set duration: 206 hours 43 minutes
|
20 |
- Validation set duration: 2 hours 22 minutes
|
21 |
- Test set duration: 2 hours 22 minutes
|
22 |
|
23 |
+
## Results:
|
24 |
- validation WER: 4.90%
|
25 |
- validation CER: 1.78%
|
26 |
+
- test WER: 4.89%
|
27 |
+
- test CER: 2.06%
|
28 |
+
|
29 |
+
## Notes
|
30 |
+
- Models did not converge, better results are possible.
|