cjli commited on
Commit
6200b64
·
1 Parent(s): 5920fbc

upload model

Browse files
Files changed (23) hide show
  1. README.md +73 -0
  2. data/iba_token_list/bpe_unigram200/bpe.model +3 -0
  3. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/28epoch.pth +3 -0
  4. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/RESULTS.md +49 -0
  5. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/config.yaml +431 -0
  6. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/acc.png +0 -0
  7. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/backward_time.png +0 -0
  8. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/cer.png +0 -0
  9. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/cer_ctc.png +0 -0
  10. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/clip.png +0 -0
  11. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/forward_time.png +0 -0
  12. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/gpu_max_cached_mem_GB.png +0 -0
  13. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/grad_norm.png +0 -0
  14. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/iter_time.png +0 -0
  15. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/loss.png +0 -0
  16. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/loss_att.png +0 -0
  17. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/loss_ctc.png +0 -0
  18. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/loss_scale.png +0 -0
  19. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/optim0_lr0.png +0 -0
  20. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/optim_step_time.png +0 -0
  21. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/train_time.png +0 -0
  22. exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/wer.png +0 -0
  23. meta.yaml +8 -0
README.md ADDED
@@ -0,0 +1,73 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - espnet
4
+ - audio
5
+ - automatic-speech-recognition
6
+ language: iba
7
+ datasets:
8
+ - Iban
9
+ license: cc-by-4.0
10
+ ---
11
+
12
+ ## ESPnet2 ASR model
13
+
14
+ ### `cjli/cjli_wavlm_transformer`
15
+
16
+ This model was trained using iban recipe in [espnet](https://github.com/espnet/espnet/).
17
+
18
+ ### Run recipe
19
+
20
+ Follow the [ESPnet installation instructions](https://espnet.github.io/espnet/installation.html) first, then:
21
+
22
+ ```bash
23
+ cd espnet/egs2/iban/asr1
24
+ ./run.sh --skip_data_prep false --skip_train true --download_model cjli/iban_wavlm_transformer
25
+ ```
26
+
27
+ ### Results
28
+
29
+ **WER**
30
+
31
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
32
+ |---|---|---|---|---|---|---|---|---|
33
+ |decode_asr_asr_model_valid.acc.best/test|104|2226|71.2|23.8|5.1|2.3|31.2|94.2|
34
+
35
+ **CER**
36
+
37
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
38
+ |---|---|---|---|---|---|---|---|---|
39
+ |decode_asr_asr_model_valid.acc.best/test|104|13527|91.8|2.7|5.4|2.2|10.4|94.2|
40
+
41
+ **TER**
42
+
43
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
44
+ |---|---|---|---|---|---|---|---|---|
45
+ |decode_asr_asr_model_valid.acc.best/test|104|5758|81.2|10.9|7.8|2.7|21.4|94.2|
46
+
47
+
48
+ ### Citing ESPnet
49
+
50
+ ```BibTex
51
+ @inproceedings{watanabe2018espnet,
52
+ author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson Yalta and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
53
+ title={{ESPnet}: End-to-End Speech Processing Toolkit},
54
+ year={2018},
55
+ booktitle={Proceedings of Interspeech},
56
+ pages={2207--2211},
57
+ doi={10.21437/Interspeech.2018-1456},
58
+ url={http://dx.doi.org/10.21437/Interspeech.2018-1456}
59
+ }
60
+ ```
61
+
62
+ or arXiv:
63
+
64
+ ```bibtex
65
+ @misc{watanabe2018espnet,
66
+ title={ESPnet: End-to-End Speech Processing Toolkit},
67
+ author={Shinji Watanabe and Takaaki Hori and Shigeki Karita and Tomoki Hayashi and Jiro Nishitoba and Yuya Unno and Nelson Yalta and Jahn Heymann and Matthew Wiesner and Nanxin Chen and Adithya Renduchintala and Tsubasa Ochiai},
68
+ year={2018},
69
+ eprint={1804.00015},
70
+ archivePrefix={arXiv},
71
+ primaryClass={cs.CL}
72
+ }
73
+ ```
data/iba_token_list/bpe_unigram200/bpe.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:007bb1faf4ba4576137adc53b3480c937b9f3a713873417288e8b667735ba572
3
+ size 240535
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/28epoch.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0caf7589588e72d28ad9d07b00b6298751c7faa4660b0e23c5796896f84c6363
3
+ size 1384762787
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/RESULTS.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [INFO] /jet/home/cli22/ocean/my-espnet/egs2/iban/asr1/../../../tools/activate_python.sh is not present
2
+ <!-- Generated by scripts/utils/show_asr_result.sh -->
3
+ # RESULTS
4
+ ## Environments
5
+ - date: `Mon May 12 12:37:16 EDT 2025`
6
+ - python version: `3.10.13 | packaged by conda-forge | (main, Dec 23 2023, 15:36:39) [GCC 12.3.0]`
7
+ - espnet version: `espnet 202412`
8
+ - pytorch version: `pytorch 2.0.1`
9
+ - Git hash: `9e12b0c877d28fba8ae1ce71abf6ed91c05d9238`
10
+ - Commit date: `Tue May 6 07:28:58 2025 -0400`
11
+
12
+ ## exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp
13
+ ### WER
14
+
15
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
16
+ |---|---|---|---|---|---|---|---|---|
17
+ |decode_asr_asr_model_valid.acc.best/test|104|2226|71.2|23.8|5.1|2.3|31.2|94.2|
18
+
19
+ ### CER
20
+
21
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
22
+ |---|---|---|---|---|---|---|---|---|
23
+ |decode_asr_asr_model_valid.acc.best/test|104|13527|91.8|2.7|5.4|2.2|10.4|94.2|
24
+
25
+ ### TER
26
+
27
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
28
+ |---|---|---|---|---|---|---|---|---|
29
+ |decode_asr_asr_model_valid.acc.best/test|104|5758|81.2|10.9|7.8|2.7|21.4|94.2|
30
+
31
+ ## exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/decode_asr_asr_model_valid.acc.best
32
+ ### WER
33
+
34
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
35
+ |---|---|---|---|---|---|---|---|---|
36
+ |org/dev|473|11006|80.0|16.4|3.5|2.4|22.3|92.0|
37
+
38
+ ### CER
39
+
40
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
41
+ |---|---|---|---|---|---|---|---|---|
42
+ |org/dev|473|67025|94.9|1.9|3.3|1.7|6.9|92.0|
43
+
44
+ ### TER
45
+
46
+ |dataset|Snt|Wrd|Corr|Sub|Del|Ins|Err|S.Err|
47
+ |---|---|---|---|---|---|---|---|---|
48
+ |org/dev|473|27176|87.7|7.5|4.8|1.9|14.2|92.0|
49
+
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/config.yaml ADDED
@@ -0,0 +1,431 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ config: conf/tuning/train_asr_wavlm_conformer.yaml
2
+ print_config: false
3
+ log_level: INFO
4
+ drop_last_iter: false
5
+ dry_run: false
6
+ iterator_type: sequence
7
+ valid_iterator_type: null
8
+ output_dir: exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp
9
+ ngpu: 1
10
+ seed: 2025
11
+ num_workers: 1
12
+ num_att_plot: 3
13
+ dist_backend: nccl
14
+ dist_init_method: env://
15
+ dist_world_size: null
16
+ dist_rank: null
17
+ local_rank: 0
18
+ dist_master_addr: null
19
+ dist_master_port: null
20
+ dist_launcher: null
21
+ multiprocessing_distributed: false
22
+ unused_parameters: false
23
+ sharded_ddp: false
24
+ use_deepspeed: false
25
+ deepspeed_config: null
26
+ gradient_as_bucket_view: true
27
+ ddp_comm_hook: null
28
+ cudnn_enabled: true
29
+ cudnn_benchmark: false
30
+ cudnn_deterministic: true
31
+ use_tf32: false
32
+ collect_stats: false
33
+ write_collected_feats: false
34
+ max_epoch: 30
35
+ patience: null
36
+ val_scheduler_criterion:
37
+ - valid
38
+ - loss
39
+ early_stopping_criterion:
40
+ - valid
41
+ - loss
42
+ - min
43
+ best_model_criterion:
44
+ - - valid
45
+ - acc
46
+ - max
47
+ keep_nbest_models: 5
48
+ nbest_averaging_interval: 0
49
+ grad_clip: 5.0
50
+ grad_clip_type: 2.0
51
+ grad_noise: false
52
+ accum_grad: 3
53
+ no_forward_run: false
54
+ resume: true
55
+ train_dtype: float32
56
+ use_amp: false
57
+ log_interval: null
58
+ use_matplotlib: true
59
+ use_tensorboard: true
60
+ create_graph_in_tensorboard: false
61
+ use_wandb: false
62
+ wandb_project: null
63
+ wandb_id: null
64
+ wandb_entity: null
65
+ wandb_name: null
66
+ wandb_model_log_interval: -1
67
+ detect_anomaly: false
68
+ use_adapter: false
69
+ adapter: lora
70
+ save_strategy: all
71
+ adapter_conf: {}
72
+ pretrain_path: null
73
+ init_param: []
74
+ ignore_init_mismatch: false
75
+ freeze_param:
76
+ - frontend.upstream
77
+ num_iters_per_epoch: null
78
+ batch_size: 20
79
+ valid_batch_size: null
80
+ batch_bins: 1600000
81
+ valid_batch_bins: null
82
+ category_sample_size: 10
83
+ train_shape_file:
84
+ - exp/asr_stats_raw_iba_bpe200_sp/train/speech_shape
85
+ - exp/asr_stats_raw_iba_bpe200_sp/train/text_shape.bpe
86
+ valid_shape_file:
87
+ - exp/asr_stats_raw_iba_bpe200_sp/valid/speech_shape
88
+ - exp/asr_stats_raw_iba_bpe200_sp/valid/text_shape.bpe
89
+ batch_type: numel
90
+ valid_batch_type: null
91
+ fold_length:
92
+ - 80000
93
+ - 150
94
+ sort_in_batch: descending
95
+ shuffle_within_batch: false
96
+ sort_batch: descending
97
+ multiple_iterator: false
98
+ chunk_length: 500
99
+ chunk_shift_ratio: 0.5
100
+ num_cache_chunks: 1024
101
+ chunk_excluded_key_prefixes: []
102
+ chunk_default_fs: null
103
+ chunk_max_abs_length: null
104
+ chunk_discard_short_samples: true
105
+ train_data_path_and_name_and_type:
106
+ - - dump/raw/train_sp/wav.scp
107
+ - speech
108
+ - sound
109
+ - - dump/raw/train_sp/text
110
+ - text
111
+ - text
112
+ valid_data_path_and_name_and_type:
113
+ - - dump/raw/dev/wav.scp
114
+ - speech
115
+ - sound
116
+ - - dump/raw/dev/text
117
+ - text
118
+ - text
119
+ multi_task_dataset: false
120
+ allow_variable_data_keys: false
121
+ max_cache_size: 0.0
122
+ max_cache_fd: 32
123
+ allow_multi_rates: false
124
+ valid_max_cache_size: null
125
+ exclude_weight_decay: false
126
+ exclude_weight_decay_conf: {}
127
+ optim: adam
128
+ optim_conf:
129
+ lr: 0.005
130
+ scheduler: warmuplr
131
+ scheduler_conf:
132
+ warmup_steps: 15000
133
+ token_list:
134
+ - <blank>
135
+ - <unk>
136
+ - ▁
137
+ - i
138
+ - u
139
+ - ka
140
+ - a
141
+ - k
142
+ - t
143
+ - e
144
+ - m
145
+ - l
146
+ - h
147
+ - p
148
+ - s
149
+ - ▁ke
150
+ - b
151
+ - r
152
+ - ▁di
153
+ - o
154
+ - an
155
+ - ▁tu
156
+ - ng
157
+ - ▁s
158
+ - ▁ti
159
+ - ▁be
160
+ - la
161
+ - ▁p
162
+ - in
163
+ - ang
164
+ - ta
165
+ - ▁ba
166
+ - ▁enggau
167
+ - ▁nya
168
+ - ▁n
169
+ - g
170
+ - er
171
+ - ja
172
+ - ▁se
173
+ - ri
174
+ - ku
175
+ - n
176
+ - ▁d
177
+ - ▁ari
178
+ - ra
179
+ - ▁m
180
+ - ▁menua
181
+ - y
182
+ - ar
183
+ - d
184
+ - ▁iya
185
+ - ▁madah
186
+ - di
187
+ - ah
188
+ - en
189
+ - ▁deka
190
+ - ▁a
191
+ - ti
192
+ - j
193
+ - ▁b
194
+ - ▁pengawa
195
+ - it
196
+ - c
197
+ - ▁pe
198
+ - ai
199
+ - ga
200
+ - lan
201
+ - ▁ma
202
+ - ung
203
+ - ▁peng
204
+ - alam
205
+ - un
206
+ - am
207
+ - ▁udah
208
+ - sa
209
+ - ▁k
210
+ - ▁menteri
211
+ - ▁orang
212
+ - da
213
+ - ur
214
+ - ▁sida
215
+ - ▁ngagai
216
+ - si
217
+ - al
218
+ - ▁datu
219
+ - ▁ng
220
+ - w
221
+ - ▁puluh
222
+ - ▁pen
223
+ - ▁berita
224
+ - ▁dua
225
+ - ma
226
+ - ▁ka
227
+ - ▁enda
228
+ - ▁nga
229
+ - ▁pem
230
+ - re
231
+ - at
232
+ - ▁na
233
+ - ▁malaysia
234
+ - ▁nge
235
+ - ntu
236
+ - ▁c
237
+ - du
238
+ - ▁l
239
+ - ▁agi
240
+ - ▁kena
241
+ - 'on'
242
+ - ▁perintah
243
+ - ▁me
244
+ - nya
245
+ - mba
246
+ - ▁tau
247
+ - f
248
+ - ▁opis
249
+ - ▁te
250
+ - ▁sarawak
251
+ - ▁diatu
252
+ - ▁nyadi
253
+ - ratus
254
+ - ▁mayuh
255
+ - ad
256
+ - ▁besai
257
+ - ▁tiga
258
+ - ▁lalu
259
+ - aya
260
+ - ▁seri
261
+ - ▁g
262
+ - ▁jaku
263
+ - ▁bis
264
+ - ▁ulih
265
+ - ▁satu
266
+ - ▁bala
267
+ - ▁taun
268
+ - ▁lima
269
+ - ▁mega
270
+ - ▁bansa
271
+ - ▁pengerami
272
+ - ▁raban
273
+ - ▁ribu
274
+ - ▁kuching
275
+ - ▁mansut
276
+ - ▁empat
277
+ - v
278
+ - ereta
279
+ - ▁ngena
280
+ - ▁ju
281
+ - ▁gawa
282
+ - ▁enam
283
+ - manah
284
+ - ▁hari
285
+ - ▁penya
286
+ - ▁patut
287
+ - ▁tadi
288
+ - ▁dikena
289
+ - ▁pekara
290
+ - ▁belas
291
+ - ▁baru
292
+ - asal
293
+ - ▁lebuh
294
+ - ▁lapan
295
+ - ▁nembiak
296
+ - ▁tujuh
297
+ - ▁semua
298
+ - ▁ringgit
299
+ - ▁endur
300
+ - aji
301
+ - ▁atur
302
+ - ▁pemansang
303
+ - ▁mensia
304
+ - ▁penemu
305
+ - ▁gerempung
306
+ - guna
307
+ - ereja
308
+ - ▁projek
309
+ - ▁ngambi
310
+ - ▁rakyat
311
+ - ▁rayat
312
+ - ▁kemari
313
+ - ▁bengkah
314
+ - ▁progr
315
+ - ▁abdul
316
+ - ▁dagang
317
+ - ▁pelajar
318
+ - erima
319
+ - ▁waifm
320
+ - ▁pengara
321
+ - ▁nitih
322
+ - ▁nasional
323
+ - ▁bejalai
324
+ - ▁laban
325
+ - ▁bendar
326
+ - ▁dayak
327
+ - ▁sembi
328
+ - z
329
+ - '-'
330
+ - x
331
+ - '2'
332
+ - q
333
+ - <sos/eos>
334
+ init: null
335
+ input_size: null
336
+ ctc_conf:
337
+ dropout_rate: 0.0
338
+ ctc_type: builtin
339
+ reduce: true
340
+ ignore_nan_grad: null
341
+ zero_infinity: true
342
+ brctc_risk_strategy: exp
343
+ brctc_group_strategy: end
344
+ brctc_risk_factor: 0.0
345
+ joint_net_conf: null
346
+ use_preprocessor: true
347
+ use_lang_prompt: false
348
+ use_nlp_prompt: false
349
+ token_type: bpe
350
+ bpemodel: data/iba_token_list/bpe_unigram200/bpe.model
351
+ non_linguistic_symbols: null
352
+ cleaner: null
353
+ g2p: null
354
+ speech_volume_normalize: null
355
+ rir_scp: null
356
+ rir_apply_prob: 1.0
357
+ noise_scp: null
358
+ noise_apply_prob: 1.0
359
+ noise_db_range: '13_15'
360
+ short_noise_thres: 0.5
361
+ aux_ctc_tasks: []
362
+ frontend: s3prl
363
+ frontend_conf:
364
+ frontend_conf:
365
+ upstream: wavlm_large
366
+ download_dir: ./hub
367
+ multilayer_feature: true
368
+ fs: 16k
369
+ specaug: specaug
370
+ specaug_conf:
371
+ apply_time_warp: true
372
+ time_warp_window: 5
373
+ time_warp_mode: bicubic
374
+ apply_freq_mask: true
375
+ freq_mask_width_range:
376
+ - 0
377
+ - 30
378
+ num_freq_mask: 2
379
+ apply_time_mask: true
380
+ time_mask_width_range:
381
+ - 0
382
+ - 40
383
+ num_time_mask: 2
384
+ normalize: utterance_mvn
385
+ normalize_conf: {}
386
+ model: espnet
387
+ model_conf:
388
+ ctc_weight: 0.3
389
+ lsm_weight: 0.1
390
+ length_normalized_loss: false
391
+ extract_feats_in_collect_stats: false
392
+ preencoder: linear
393
+ preencoder_conf:
394
+ input_size: 1024
395
+ output_size: 80
396
+ encoder: conformer
397
+ encoder_conf:
398
+ output_size: 256
399
+ attention_heads: 4
400
+ linear_units: 1024
401
+ num_blocks: 12
402
+ dropout_rate: 0.1
403
+ positional_dropout_rate: 0.1
404
+ attention_dropout_rate: 0.1
405
+ input_layer: conv2d
406
+ normalize_before: true
407
+ macaron_style: true
408
+ rel_pos_type: latest
409
+ pos_enc_layer_type: rel_pos
410
+ selfattention_layer_type: rel_selfattn
411
+ activation_type: swish
412
+ use_cnn_module: true
413
+ cnn_module_kernel: 31
414
+ postencoder: null
415
+ postencoder_conf: {}
416
+ decoder: transformer
417
+ decoder_conf:
418
+ attention_heads: 4
419
+ linear_units: 2048
420
+ num_blocks: 6
421
+ dropout_rate: 0.1
422
+ positional_dropout_rate: 0.1
423
+ self_attention_dropout_rate: 0.1
424
+ src_attention_dropout_rate: 0.1
425
+ preprocessor: default
426
+ preprocessor_conf: {}
427
+ required:
428
+ - output_dir
429
+ - token_list
430
+ version: '202412'
431
+ distributed: false
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/acc.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/backward_time.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/cer.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/cer_ctc.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/clip.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/forward_time.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/gpu_max_cached_mem_GB.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/grad_norm.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/iter_time.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/loss.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/loss_att.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/loss_ctc.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/loss_scale.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/optim0_lr0.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/optim_step_time.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/train_time.png ADDED
exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/images/wer.png ADDED
meta.yaml ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ espnet: '202412'
2
+ files:
3
+ asr_model_file: exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/28epoch.pth
4
+ python: 3.10.13 | packaged by conda-forge | (main, Dec 23 2023, 15:36:39) [GCC 12.3.0]
5
+ timestamp: 1747077768.065848
6
+ torch: 2.0.1
7
+ yaml_files:
8
+ asr_train_config: exp/asr_train_asr_wavlm_conformer_raw_iba_bpe200_sp/config.yaml