linagora
/

linto_stt_fr_fastconformer

Automatic Speech Recognition

Model card Files Files and versions Community

AudranB commited on 8 days ago

Commit

8469512

·

verified ·

1 Parent(s): ffa8bd3

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -106,12 +106,15 @@ img {
 ## Overview
-This model is a fine-tuned version of the [NVIDIA French FastConformer Hybrid Large model](https://huggingface.co/nvidia/stt_fr_fastconformer_hybrid_large_pc). It is a large (115M parameters) hybrid ASR model trained with both **Transducer (default)** and **CTC** losses.
 Compared to the base model, this version:
 - Does **not** include punctuation or uppercase letters.
 - Was trained on **9,500+ hours** of diverse, manually transcribed French speech.
 ---
 ## Performance
@@ -173,6 +176,8 @@ asr_model.transcribe([audio_path])
 ## Datasets
 The model was trained on over 9,500 hours of French speech, covering:
 - Read and spontaneous speech
 - Conversations and meetings

 ## Overview
+This model is a fine-tuned version of the [NVIDIA French FastConformer Hybrid Large model](https://huggingface.co/nvidia/stt_fr_fastconformer_hybrid_large_pc).
+It is a large (115M parameters) hybrid ASR model trained with both **Transducer (default)** and **CTC** losses.
 Compared to the base model, this version:
 - Does **not** include punctuation or uppercase letters.
 - Was trained on **9,500+ hours** of diverse, manually transcribed French speech.
+The training code is available in the [nemo asr training repository](https://github.com/linagora-labs/nemo_asr_training).
 ---
 ## Performance
 ## Datasets
+The data were transformed, processed and converted using [NeMo tools from the SSAK repository](https://github.com/linagora-labs/ssak/tree/main/tools/nemo)
 The model was trained on over 9,500 hours of French speech, covering:
 - Read and spontaneous speech
 - Conversations and meetings