mozilla-ai
/

whisper-small-el

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

kostissz commited on 20 days ago

Commit

cece427

·

verified ·

1 Parent(s): 57e8e75

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +20 -65

README.md CHANGED Viewed

@@ -1,80 +1,35 @@
 ---
-library_name: transformers
-license: apache-2.0
 base_model: openai/whisper-small
-tags:
-- generated_from_trainer
 datasets:
-- common_voice_17_0
-metrics:
-- wer
 model-index:
-- name: whisper-small-el
   results:
   - task:
-      name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
-      name: common_voice_17_0
-      type: common_voice_17_0
-      config: el
-      split: None
-      args: el
     metrics:
-    - name: Wer
-      type: wer
-      value: 45.63223714682724
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# whisper-small-el
-This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the common_voice_17_0 dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.8687
-- Model Preparation Time: 0.0059
-- Wer: 45.6322
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- learning_rate: 1e-05
-- train_batch_size: 32
-- eval_batch_size: 8
-- seed: 42
-- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
-- lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 50
-- training_steps: 14
-- mixed_precision_training: Native AMP
-### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Model Preparation Time | Wer     |
-|:-------------:|:------:|:----:|:---------------:|:----------------------:|:-------:|
-| 0.8509        | 0.0439 | 5    | 0.9019          | 0.0059                 | 46.4382 |
-| 0.8082        | 0.0877 | 10   | 0.8687          | 0.0059                 | 45.6322 |
-### Framework versions
-- Transformers 4.48.3
-- Pytorch 2.5.1+cu124
-- Datasets 3.3.1
-- Tokenizers 0.21.0

 ---
 base_model: openai/whisper-small
 datasets:
+- mozilla-foundation/common_voice_17_0
+language: el
+library_name: transformers
+license: apache-2.0
 model-index:
+- name: Finetuned openai/whisper-small on Greek
   results:
   - task:
       type: automatic-speech-recognition
+      name: Speech-to-Text
     dataset:
+      name: Common Voice (Greek)
+      type: common_voice
     metrics:
+    - type: wer
+      value: 45.632
 ---
+# Finetuned openai/whisper-small on 3620 Greek training audio samples from mozilla-foundation/common_voice_17_0.
+This model was created from the Mozilla.ai Blueprint:
+[speech-to-text-finetune](https://github.com/mozilla-ai/speech-to-text-finetune).
+## Evaluation results on 1701 audio samples of Greek:
+### Baseline model (before finetuning) on Greek
+- Word Error Rate: 46.392
+- Loss: 0.902
+### Finetuned model (after finetuning) on Greek
+- Word Error Rate: 45.632
+- Loss: 0.869