GetmanY1
/

wav2vec2-large-fi-lp-cont-pt

Automatic Speech Recognition

Model card Files Files and versions Community

GetmanY1 commited on Sep 13, 2024

Commit

87bb4f9

·

verified ·

1 Parent(s): 8e2a2ac

Update README.md

Files changed (1) hide show

README.md +9 -10

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ The large model pre-trained on 16kHz sampled speech audio with [facebook/wav2vec
 The Finnish Wav2Vec2 Base has the same architecture and uses the same training objective as the English and multilingual one described in [Paper](https://arxiv.org/abs/2006.11477). It is pre-trained on 2600 hours of unlabeled colloquial Finnish speech from [Lahjoita puhetta (Donate Speech)](https://link.springer.com/article/10.1007/s10579-022-09606-3).
-You can read more about the pre-trained model from [this paper](TODO). The training scripts are available on [GitHub](https://github.com/aalto-speech/colloquial-Finnish-wav2vec2)
 ## Intended uses & limitations
@@ -37,15 +37,14 @@ The model was pre-trained on the data from the [Lahjoita puhetta (Donate Speech)
 If you use our models or scripts, please cite our article as:
 ```bibtex
-@inproceedings{getman24a_interspeech,
-  author={Yaroslav Getman and Tamas Grosz and Mikko Kurimo},
-  title={{What happens in continued pre-training? Analysis of self-supervised speech
-models with continued pre-training for colloquial Finnish ASR}},
-  year=2024,
-  booktitle={Proc. INTERSPEECH 2024},
-  pages={XX--XX},
-  doi={XXXX},
-  issn={XXXX-XXXX}
 }
 ```

 The Finnish Wav2Vec2 Base has the same architecture and uses the same training objective as the English and multilingual one described in [Paper](https://arxiv.org/abs/2006.11477). It is pre-trained on 2600 hours of unlabeled colloquial Finnish speech from [Lahjoita puhetta (Donate Speech)](https://link.springer.com/article/10.1007/s10579-022-09606-3).
+You can read more about the pre-trained model from [this paper](https://www.isca-archive.org/interspeech_2024/getman24_interspeech.html). The training scripts are available on [GitHub](https://github.com/aalto-speech/colloquial-Finnish-wav2vec2)
 ## Intended uses & limitations
 If you use our models or scripts, please cite our article as:
 ```bibtex
+@inproceedings{getman24_interspeech,
+  title     = {What happens in continued pre-training? Analysis of self-supervised speech
+ models with continued pre-training for colloquial Finnish ASR},
+  author    = {Yaroslav Getman and Tamas Grosz and Mikko Kurimo},
+  year      = {2024},
+  booktitle = {Interspeech 2024},
+  pages     = {5043--5047},
+  doi       = {10.21437/Interspeech.2024-476},
 }
 ```