Leonoravesterbacka commited on
Commit
72bcba4
·
verified ·
1 Parent(s): f56713f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -11
README.md CHANGED
@@ -217,40 +217,75 @@ Stage 1 employed low threshold values (0 to 0.30 BLEU depending on dataset), whe
217
  | NST | 250 | 250 |
218
  | **Total** | **56,514** | **8,533** |
219
 
220
- The default when loading our models through Hugging Face is **Stage 2**. We have however also uploaded continued pretraining checkpoints and tagged them. You can load these other checkpoints by specifying the `revision` in `.from_pretrained()`. The pretrained checkpoints tag can for example be found here: [`pretrained-checkpoint`](https://huggingface.co/KBLab/kb-whisper-large/tree/pretrained-checkpoint). The Stage 2 default model tag is named `standard`. We supply a different stage 2 checkpoint -- with a more condensed style of transcribing -- under the name `subtitle`.
221
 
222
  ### Evaluation
223
 
224
 
225
- #### WER
226
  | Model size | | FLEURS | CommonVoice | NST |
227
  |------------|---------|--------|-------------|------|
228
  | [tiny](https://huggingface.co/KBLab/kb-whisper-tiny) | **KBLab** | **13.2** | **12.9** | **11.2** |
229
  | | OpenAI | 59.2 | 67.8 | 85.2 |
230
  | [base](https://huggingface.co/KBLab/kb-whisper-base) | **KBLab** | **9.1** | **8.7** | **7.8** |
231
  | | OpenAI | 39.6 | 52.1 | 53.4 |
232
- | [small](https://huggingface.co/KBLab/kb-whisper-small) | **KBLab** | **7.3** | **6.4** | **6.6** |
233
  | | OpenAI | 20.6 | 26.4 | 26.4 |
234
- | [medium](https://huggingface.co/KBLab/kb-whisper-medium) | **KBLab** | **6.6** | **5.4** | **5.8** |
235
  | | OpenAI | 12.1 | 15.8 | 17.1 |
236
- | [large-v3](https://huggingface.co/KBLab/kb-whisper-large) | **KBLab** | **5.4** | **4.1** | **5.2** |
237
  | | OpenAI | 7.8 | 9.5 | 11.3 |
238
 
 
239
 
240
- #### BLEU Score
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
241
  | Model size | | FLEURS | CommonVoice | NST |
242
  |------------|---------|--------|-------------|------|
243
- | tiny | KBLab | **76.6** | **73.7** | **74.3** |
244
  | | OpenAI | 26.9 | 21.1 | 24.0 |
245
- | base | KBLab | **83.2** | **79.9** | **78.3** |
246
  | | OpenAI | 41.1 | 32.5 | 36.9 |
247
- | small | KBLab | **86.6** | **83.5** | **79.6** |
248
  | | OpenAI | 64.0 | 56.5 | 58.2 |
249
- | medium | KBLab | **87.6** | **85.0** | **80.2** |
250
  | | OpenAI | 77.1 | 70.1 | 68.9 |
251
- | large-v3 | KBLab | **89.8** | **87.2** | **81.1** |
252
  | | OpenAI | 84.9 | 79.1 | 75.1 |
253
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
254
 
255
  ### Acknowledgements
256
 
 
217
  | NST | 250 | 250 |
218
  | **Total** | **56,514** | **8,533** |
219
 
220
+ The default when loading our models through Hugging Face is **Stage 2**. We have however also uploaded continued pretraining checkpoints and tagged them. You can load these other checkpoints by specifying the `revision` in `.from_pretrained()`. The pretrained checkpoints tag can for example be found here: [`pretrained-checkpoint`](https://huggingface.co/KBLab/kb-whisper-large/tree/pretrained-checkpoint). The Stage 2 default model tag is named `standard`. We supply two different stage 2 checkpoints -- one with a more condensed style of transcribing -- under the name `subtitle`, and one more verbose called `strict`.
221
 
222
  ### Evaluation
223
 
224
 
225
+ #### WER compared to OpenAI
226
  | Model size | | FLEURS | CommonVoice | NST |
227
  |------------|---------|--------|-------------|------|
228
  | [tiny](https://huggingface.co/KBLab/kb-whisper-tiny) | **KBLab** | **13.2** | **12.9** | **11.2** |
229
  | | OpenAI | 59.2 | 67.8 | 85.2 |
230
  | [base](https://huggingface.co/KBLab/kb-whisper-base) | **KBLab** | **9.1** | **8.7** | **7.8** |
231
  | | OpenAI | 39.6 | 52.1 | 53.4 |
232
+ | [small](https://huggingface.co/KBLab/kb-whisper-small) | **KBLab** | **7.3** | **6.4** | **6.6** |
233
  | | OpenAI | 20.6 | 26.4 | 26.4 |
234
+ | [medium](https://huggingface.co/KBLab/kb-whisper-medium) | **KBLab** | **6.6** | **5.4** | **5.8** |
235
  | | OpenAI | 12.1 | 15.8 | 17.1 |
236
+ | [large-v3](https://huggingface.co/KBLab/kb-whisper-large) | **KBLab** | **5.4** | **4.1** | **5.2** |
237
  | | OpenAI | 7.8 | 9.5 | 11.3 |
238
 
239
+ #### WER for different KBLab stage2 versions
240
 
241
+ | Model size | | FLEURS | CommonVoice | NST |
242
+ |------------|---------|--------|-------------|------|
243
+ | [tiny](https://huggingface.co/KBLab/kb-whisper-tiny) | **standard** | **13.2** | **12.9** | **11.2** |
244
+ | | strict | 14.1 | 13.4 | 11.0 |
245
+ | | subtitle | 13.3 | 12.9 | 11.4 |
246
+ | [base](https://huggingface.co/KBLab/kb-whisper-base) | **standard** | **9.1** | **8.7** | **7.8** |
247
+ | | strict | 10.4 | 9.6 | 8.4 |
248
+ | | subtitle | 9.1 | 8.7 | 7.9 |
249
+ | [small](https://huggingface.co/KBLab/kb-whisper-small) | **standard** | **7.3** | **6.4** | **6.6** |
250
+ | | strict | 8.2 | 7.0 | 6.7 |
251
+ | | subtitle | 7.3 | 6.4 | 6.6 |
252
+ | [medium](https://huggingface.co/KBLab/kb-whisper-medium) | **standard** | **6.6** | **5.4** | **5.8** |
253
+ | | strict | 6.8 | 5.4 | 6.0 |
254
+ | [large-v3](https://huggingface.co/KBLab/kb-whisper-large) | **standard** | **5.4** | **4.1** | **5.2** |
255
+ | | strict | 5.3 | 4.0 | 5.1 |
256
+
257
+
258
+ #### BLEU Score compared to OpenAI
259
  | Model size | | FLEURS | CommonVoice | NST |
260
  |------------|---------|--------|-------------|------|
261
+ | [tiny](https://huggingface.co/KBLab/kb-whisper-tiny) | **KBLab** | **76.6** | **73.7** | **74.3** |
262
  | | OpenAI | 26.9 | 21.1 | 24.0 |
263
+ | [base](https://huggingface.co/KBLab/kb-whisper-base) | **KBLab** | **83.2** | **79.9** | **78.3** |
264
  | | OpenAI | 41.1 | 32.5 | 36.9 |
265
+ | [small](https://huggingface.co/KBLab/kb-whisper-small) | **KBLab** | **86.6** | **83.5** | **79.6** |
266
  | | OpenAI | 64.0 | 56.5 | 58.2 |
267
+ | [medium](https://huggingface.co/KBLab/kb-whisper-medium) | **KBLab** | **87.6** | **85.0** | **80.2** |
268
  | | OpenAI | 77.1 | 70.1 | 68.9 |
269
+ | [large-v3](https://huggingface.co/KBLab/kb-whisper-large) | **KBLab** | **89.8** | **87.2** | **81.1** |
270
  | | OpenAI | 84.9 | 79.1 | 75.1 |
271
 
272
+ #### BLEU Score compared to OpenAI
273
+ | Model size | | FLEURS | CommonVoice | NST |
274
+ |------------|---------|--------|-------------|------|
275
+ | [tiny](https://huggingface.co/KBLab/kb-whisper-tiny) | **standard** | **76.6** | **73.7** | **74.3** |
276
+ | | strict | 75.3 | 72.9 | 74.6 |
277
+ | | subtitle | 76.6 | 73.7 | 74.1 |
278
+ | [base](https://huggingface.co/KBLab/kb-whisper-base) | **standard** | **83.2** | **79.9** | **78.3** |
279
+ | | strict | 81.0 | 78.4 | 77.5 |
280
+ | | subtitle | 83.2 | 79.8 | 78.2 |
281
+ | [small](https://huggingface.co/KBLab/kb-whisper-small) | **standard** | **86.6** | **83.5** | **79.6** |
282
+ | | strict | 84.9 | 82.4 | 79.3 |
283
+ | | subtitle | 86.6 | 83.5 | 79.6 |
284
+ | [medium](https://huggingface.co/KBLab/kb-whisper-medium) | **standard** | **87.6** | **85.0** | **80.2** |
285
+ | | strict | 87.3 | 84.9 | 80.1 |
286
+ | [large-v3](https://huggingface.co/KBLab/kb-whisper-large) | **standard** | **89.8** | **87.2** | **81.1** |
287
+ | | strict | 90.0 | 87.4 | 81.2 |
288
+
289
 
290
  ### Acknowledgements
291