End of training

Browse files

Files changed (5) hide show

README.md +127 -195
adapter.ibo.safetensors +3 -0
emissions.csv +2 -0
model.safetensors +1 -1
runs/Jul29_22-12-51_b16dc27cf29a/events.out.tfevents.1753827625.b16dc27cf29a.19645.0 +2 -2

README.md CHANGED Viewed

@@ -1,199 +1,131 @@
 ---
 library_name: transformers
-tags: []
 ---
-# Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
-## Model Details
-### Model Description
-<!-- Provide a longer summary of what this model is. -->
-This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
-- **Developed by:** [More Information Needed]
-- **Funded by [optional]:** [More Information Needed]
-- **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
-### Model Sources [optional]
-<!-- Provide the basic links for the model. -->
-- **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
-- **Demo [optional]:** [More Information Needed]
-## Uses
-<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
-### Direct Use
-<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
-[More Information Needed]
-### Downstream Use [optional]
-<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
-[More Information Needed]
-### Out-of-Scope Use
-<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
-[More Information Needed]
-## Bias, Risks, and Limitations
-<!-- This section is meant to convey both technical and sociotechnical limitations. -->
-[More Information Needed]
-### Recommendations
-<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
-Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
-## How to Get Started with the Model
-Use the code below to get started with the model.
-[More Information Needed]
-## Training Details
-### Training Data
-<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-[More Information Needed]
-### Training Procedure
-<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
-#### Preprocessing [optional]
-[More Information Needed]
-#### Training Hyperparameters
-- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
-#### Speeds, Sizes, Times [optional]
-<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
-[More Information Needed]
-## Evaluation
-<!-- This section describes the evaluation protocols and provides the results. -->
-### Testing Data, Factors & Metrics
-#### Testing Data
-<!-- This should link to a Dataset Card if possible. -->
-[More Information Needed]
-#### Factors
-<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
-[More Information Needed]
-#### Metrics
-<!-- These are the evaluation metrics being used, ideally with a description of why. -->
-[More Information Needed]
-### Results
-[More Information Needed]
-#### Summary
-## Model Examination [optional]
-<!-- Relevant interpretability work for the model goes here -->
-[More Information Needed]
-## Environmental Impact
-<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
-Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
-- **Hardware Type:** [More Information Needed]
-- **Hours used:** [More Information Needed]
-- **Cloud Provider:** [More Information Needed]
-- **Compute Region:** [More Information Needed]
-- **Carbon Emitted:** [More Information Needed]
-## Technical Specifications [optional]
-### Model Architecture and Objective
-[More Information Needed]
-### Compute Infrastructure
-[More Information Needed]
-#### Hardware
-[More Information Needed]
-#### Software
-[More Information Needed]
-## Citation [optional]
-<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
-**BibTeX:**
-[More Information Needed]
-**APA:**
-[More Information Needed]
-## Glossary [optional]
-<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
-[More Information Needed]
-## More Information [optional]
-[More Information Needed]
-## Model Card Authors [optional]
-[More Information Needed]
-## Model Card Contact
-[More Information Needed]

 ---
 library_name: transformers
+license: cc-by-nc-4.0
+base_model: facebook/mms-1b-all
+tags:
+- generated_from_trainer
+datasets:
+- fleurs
+metrics:
+- wer
+model-index:
+- name: wav2vec2-large-mms-1b-igbo
+  results:
+  - task:
+      name: Automatic Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: fleurs
+      type: fleurs
+      config: ig_ng
+      split: test
+      args: ig_ng
+    metrics:
+    - name: Wer
+      type: wer
+      value: 0.444900640499261
 ---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# wav2vec2-large-mms-1b-igbo
+This model is a fine-tuned version of [facebook/mms-1b-all](https://huggingface.co/facebook/mms-1b-all) on the fleurs dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.4649
+- Wer: 0.4449
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.001
+- train_batch_size: 4
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 100
+- num_epochs: 4
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch  | Step  | Validation Loss | Wer    |
+|:-------------:|:------:|:-----:|:---------------:|:------:|
+| 0.464         | 0.0731 | 1000  | 0.7265          | 0.5768 |
+| 0.4324        | 0.1463 | 2000  | 0.7455          | 0.6102 |
+| 0.4307        | 0.2194 | 3000  | 1.1129          | 0.6445 |
+| 0.3982        | 0.2925 | 4000  | 0.7999          | 0.5870 |
+| 0.3915        | 0.3657 | 5000  | 0.7252          | 0.5210 |
+| 0.3834        | 0.4388 | 6000  | 0.7565          | 0.5677 |
+| 0.376         | 0.5120 | 7000  | 0.7596          | 0.6294 |
+| 0.388         | 0.5851 | 8000  | 0.6784          | 0.5679 |
+| 0.3687        | 0.6582 | 9000  | 0.7597          | 0.5916 |
+| 0.374         | 0.7314 | 10000 | 0.6482          | 0.5023 |
+| 0.3576        | 0.8045 | 11000 | 0.6486          | 0.5572 |
+| 0.3621        | 0.8776 | 12000 | 0.5482          | 0.4869 |
+| 0.363         | 0.9508 | 13000 | 0.6543          | 0.5082 |
+| 0.3549        | 1.0239 | 14000 | 0.5477          | 0.4849 |
+| 0.342         | 1.0971 | 15000 | 0.5505          | 0.5079 |
+| 0.3296        | 1.1702 | 16000 | 0.5701          | 0.5211 |
+| 0.3363        | 1.2433 | 17000 | 0.5565          | 0.5281 |
+| 0.3265        | 1.3165 | 18000 | 0.6660          | 0.5794 |
+| 0.327         | 1.3896 | 19000 | 0.5414          | 0.4854 |
+| 0.3319        | 1.4627 | 20000 | 0.5677          | 0.5181 |
+| 0.3273        | 1.5359 | 21000 | 0.5482          | 0.4901 |
+| 0.3209        | 1.6090 | 22000 | 0.5475          | 0.5019 |
+| 0.3153        | 1.6821 | 23000 | 0.5278          | 0.4723 |
+| 0.3214        | 1.7553 | 24000 | 0.5232          | 0.4809 |
+| 0.3227        | 1.8284 | 25000 | 0.5419          | 0.4950 |
+| 0.306         | 1.9016 | 26000 | 0.5120          | 0.4653 |
+| 0.2956        | 1.9747 | 27000 | 0.5043          | 0.4790 |
+| 0.2875        | 2.0478 | 28000 | 0.5111          | 0.4592 |
+| 0.3158        | 2.1210 | 29000 | 0.4959          | 0.4582 |
+| 0.2906        | 2.1941 | 30000 | 0.4857          | 0.4577 |
+| 0.2985        | 2.2672 | 31000 | 0.4897          | 0.4625 |
+| 0.2877        | 2.3404 | 32000 | 0.4869          | 0.4667 |
+| 0.2832        | 2.4135 | 33000 | 0.4877          | 0.4541 |
+| 0.2815        | 2.4867 | 34000 | 0.4869          | 0.4598 |
+| 0.28          | 2.5598 | 35000 | 0.4935          | 0.4624 |
+| 0.2904        | 2.6329 | 36000 | 0.4859          | 0.4540 |
+| 0.2767        | 2.7061 | 37000 | 0.4879          | 0.4550 |
+| 0.2801        | 2.7792 | 38000 | 0.4855          | 0.4536 |
+| 0.2711        | 2.8523 | 39000 | 0.5059          | 0.4674 |
+| 0.2652        | 2.9255 | 40000 | 0.4715          | 0.4512 |
+| 0.276         | 2.9986 | 41000 | 0.4804          | 0.4568 |
+| 0.2556        | 3.0717 | 42000 | 0.4869          | 0.4572 |
+| 0.275         | 3.1449 | 43000 | 0.4761          | 0.4536 |
+| 0.2615        | 3.2180 | 44000 | 0.4848          | 0.4679 |
+| 0.264         | 3.2912 | 45000 | 0.4722          | 0.4518 |
+| 0.2554        | 3.3643 | 46000 | 0.4747          | 0.4551 |
+| 0.2632        | 3.4374 | 47000 | 0.4695          | 0.4507 |
+| 0.2565        | 3.5106 | 48000 | 0.4761          | 0.4506 |
+| 0.2555        | 3.5837 | 49000 | 0.4802          | 0.4619 |
+| 0.2397        | 3.6568 | 50000 | 0.4687          | 0.4497 |
+| 0.2599        | 3.7300 | 51000 | 0.4684          | 0.4506 |
+| 0.2451        | 3.8031 | 52000 | 0.4678          | 0.4504 |
+| 0.2623        | 3.8763 | 53000 | 0.4642          | 0.4461 |
+| 0.2475        | 3.9494 | 54000 | 0.4649          | 0.4449 |
+### Framework versions
+- Transformers 4.54.0.dev0
+- Pytorch 2.5.1+cu121
+- Datasets 3.2.0
+- Tokenizers 0.21.0

adapter.ibo.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:49659695f3426fbd93462d97eb5f6d633bc6d3bf44967d0dd2fd7d78f9676dd5
+size 9152120

emissions.csv ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	+ 2025-07-30T02:23:28,codecarbon,4552b67d-99c2-406c-9228-fb9b7db6c238,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,14583.18219661247,0.2174535746209526,1.4911256794930872e-05,42.5,110.64112064398002,47.10247850418091,0.1719119908472117,0.8874324396676059,0.19002974139528025,1.2493741719100981,Spain,ESP,andalusia,,,Linux-6.8.0-59-generic-x86_64-with-glibc2.35,3.10.12,2.8.2,32,13th Gen Intel(R) Core(TM) i9-13900K,1,1 x NVIDIA GeForce RTX 4090,-5.9452,37.4857,125.60660934448242,machine,N,1.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4fa96cf62e6b29c7919bd3e778e6fdad65db1284e593959d3c19d3114c6e1472
 size 3859244480

 version https://git-lfs.github.com/spec/v1
+oid sha256:ee54426c81089ea308e4e7b6cf924f62d94c49094932ec27f3cb53d9231639b7
 size 3859244480

runs/Jul29_22-12-51_b16dc27cf29a/events.out.tfevents.1753827625.b16dc27cf29a.19645.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b79010a8faabc2c6d1f1b0bab1f9842d0f29d89c7b636f25a897d9726104bd4f
-size 46120

 version https://git-lfs.github.com/spec/v1
+oid sha256:19f63f4c1bdcb5a6bc3de98668d1e73713ba4a5b6ce761e466a2b591135f5b94
+size 47449