update readme and add plots

Browse files

Signed-off-by: monica-sekoyan <[email protected]>

Files changed (5) hide show

.gitattributes +1 -0
README.md +9 -9
plots/asr.png +3 -0
plots/en_x.png +3 -0
plots/x_en.png +3 -0

.gitattributes CHANGED Viewed

@@ -34,3 +34,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 canary-1b-v2.nemo filter=lfs diff=lfs merge=lfs -text

 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 canary-1b-v2.nemo filter=lfs diff=lfs merge=lfs -text
+*.png filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -42,7 +42,7 @@ We will soon release a comprehensive **Canary-1b-v2 technical report** detailing
 ### Automatic Speech Recognition (ASR)
-![ASR WER Comparison](plots/asr_performance.png)
 *Figure 1: ASR WER comparison across different models. This does not include Punctuation and Capitalisation errors.*
@@ -52,14 +52,14 @@ We will soon release a comprehensive **Canary-1b-v2 technical report** detailing
 #### X → English
-![AST X-En Comparison](plots/x_en_performance.png)
 *Figure 2: AST X → En COMET scores comparison across different models*
 #### English → X
-![AST En-X Comparison](plots/en_x_performance.png)
 *Figure 3: AST En → X COMET scores comparison across different models*
@@ -283,7 +283,7 @@ This section reports the evaluation results of the ``Canary-1b-v2`` model across
 | **WER ↓**          | Fleurs-25 Langs       | CoVoST-13 Langs      | MLS - 6 Langs      |
 | ---------------    | --------------------  | -------------------- | ------------------ |
-| **`Canary-1b-v2`** | 8.42%                 | 7.61%                | 7.29%              |
 **Note:** Presented WERs do not include Punctuation and Capitalization errors.
@@ -326,9 +326,9 @@ More details on evaluation can be found at [HuggingFace ASR Leaderboard](https:/
 Performance across different Signal-to-Noise Ratios (SNR) using MUSAN music and noise samples \[16] on the [LibriSpeech Clean test set](https://www.openslr.org/12).
 **Metric**: Word Error Rate (**WER**)
-| **SNR (dB)**    | 100   | 50    | 25    | 15    | 10    | 5     | 0     | -5        |
-| --------------- | ----- | ----- | ----- | ----- | ----- | ----- | ----- | ------    |
-| **`Canary-1b-v2`** | 2.19% | 2.16% | 2.01% | 2.17% | 2.29% | 2.80% | 5.08% | 19.38% |
 ### Hallucination Robustness
@@ -346,8 +346,8 @@ Number of characters per minute on [MUSAN](https://www.openslr.org/17) \[16] 48
 | **Dataset**             | **WER ↓** |
 | ----------------------- | --------- |
-| Earnings-22             | TBD       |
-| This American Life      | TBD       |
 **Note:** Presented WERs do not include Punctuation and Capitalization errors.

 ### Automatic Speech Recognition (ASR)
+![ASR WER Comparison](plots/asr.png)
 *Figure 1: ASR WER comparison across different models. This does not include Punctuation and Capitalisation errors.*
 #### X → English
+![AST X-En Comparison](plots/x_en.png)
 *Figure 2: AST X → En COMET scores comparison across different models*
 #### English → X
+![AST En-X Comparison](plots/en_x.png)
 *Figure 3: AST En → X COMET scores comparison across different models*
 | **WER ↓**          | Fleurs-25 Langs       | CoVoST-13 Langs      | MLS - 6 Langs      |
 | ---------------    | --------------------  | -------------------- | ------------------ |
+| **`Canary-1b-v2`** | 8.40%                 | 8.85%                | 7.27%              |
 **Note:** Presented WERs do not include Punctuation and Capitalization errors.
 Performance across different Signal-to-Noise Ratios (SNR) using MUSAN music and noise samples \[16] on the [LibriSpeech Clean test set](https://www.openslr.org/12).
 **Metric**: Word Error Rate (**WER**)
+| **SNR (dB)**       | 100      | 10    | 5     | 0     | -5     |
+| ---------------    | -----    | ----- | ----- | ----- | -----  |
+| **`Canary-1b-v2`** | 2.18% | 2.29% | 2.80% | 5.08% | 19.38% |
 ### Hallucination Robustness
 | **Dataset**             | **WER ↓** |
 | ----------------------- | --------- |
+| Earnings-22             | 13.51%    |
+| This American Life      | 8.65%     |
 **Note:** Presented WERs do not include Punctuation and Capitalization errors.

plots/asr.png ADDED Viewed

Git LFS Details

SHA256: 9ed105e2323d620f77c90f8084494b2db2f7916dc99975b0f13c0d1523b985c5
Pointer size: 131 Bytes
Size of remote file: 136 kB

plots/en_x.png ADDED Viewed

Git LFS Details

SHA256: 49967add7a65e021eec831cba15f45904562f694535d155f20a31c46d4a420da
Pointer size: 131 Bytes
Size of remote file: 122 kB

plots/x_en.png ADDED Viewed

Git LFS Details

SHA256: a444a795dca8fc049a4d4702ca94820cf576c732466d7ffcf050335f6b7c9db9
Pointer size: 131 Bytes
Size of remote file: 121 kB