base_model: openai/whisper-large-v3-turbo | |
datasets: | |
- fi | |
language: fi | |
library_name: transformers | |
license: apache-2.0 | |
model-index: | |
- name: Finetuned openai/whisper-large-v3-turbo on Finnish | |
results: | |
- task: | |
type: automatic-speech-recognition | |
name: Speech-to-Text | |
dataset: | |
name: Common Voice (Finnish) | |
type: common_voice | |
metrics: | |
- type: wer | |
value: 11.972 | |
# Finetuned openai/whisper-large-v3-turbo on 2093 Finnish training audio samples from /home/kostis/cv-corpus-22.0-2025-06-20/fi. | |
This model was created from the Mozilla.ai Blueprint: | |
[speech-to-text-finetune](https://github.com/mozilla-ai/speech-to-text-finetune). | |
## Evaluation results on 1806 audio samples of Finnish: | |
### Baseline model (before finetuning) on Finnish | |
- Word Error Rate (Normalized): 16.576 | |
- Word Error Rate (Orthographic): 20.298 | |
- Character Error Rate (Normalized): 3.808 | |
- Character Error Rate (Orthographic): 4.498 | |
- Loss: 2.112 | |
### Finetuned model (after finetuning) on Finnish | |
- Word Error Rate (Normalized): 11.972 | |
- Word Error Rate (Orthographic): 15.884 | |
- Character Error Rate (Normalized): 2.255 | |
- Character Error Rate (Orthographic): 3.119 | |
- Loss: 0.172 | |