leonardvorbeck
/

wav2vec2-large-robust-LS960

Automatic Speech Recognition

Model card Files Files and versions

leonardvorbeck commited on Aug 24, 2021

Commit

3c9ddc6

·

1 Parent(s): 8a277b8

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -7,7 +7,9 @@ datasets:
 - fisher
 tags:
 - speech
-- asr
 - wav2vec2
 license: apache-2.0
 ---
@@ -23,7 +25,8 @@ Speech datasets from multiple domains were used to pretrain the model:
 - [Switchboard](https://catalog.ldc.upenn.edu/LDC97S62): telephone speech corpus; noisy telephone data
 - [Fisher](https://catalog.ldc.upenn.edu/LDC2004T19): conversational telephone speech; noisy telephone data
-When using the model make sure that your speech input is also sampled at 16Khz. Note that this model should be fine-tuned on a downstream task, like Automatic Speech Recognition. Check out [this blog](https://huggingface.co/blog/fine-tune-wav2vec2-english) for more information.
 [Paper Robust Wav2Vec2](https://arxiv.org/abs/2104.01027)

 - fisher
 tags:
 - speech
+- automatic-speech-recognition
+- CTC
+- Attention
 - wav2vec2
 license: apache-2.0
 ---
 - [Switchboard](https://catalog.ldc.upenn.edu/LDC97S62): telephone speech corpus; noisy telephone data
 - [Fisher](https://catalog.ldc.upenn.edu/LDC2004T19): conversational telephone speech; noisy telephone data
+When using the model make sure that your speech input is also sampled at 16Khz.
+Check out [this blog](https://huggingface.co/blog/fine-tune-wav2vec2-english) for more information.
 [Paper Robust Wav2Vec2](https://arxiv.org/abs/2104.01027)