nvidia
/

parakeet-tdt-0.6b-v3

Automatic Speech Recognition

hf-asr-leaderboard

Model card Files Files and versions Community

nithinraok commited on 1 day ago

Commit

bb0964b

·

1 Parent(s): 7938c10

Add streaming inference info

Signed-off-by: nithinraok <[email protected]>

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -927,8 +927,25 @@ output = asr_model.transcribe(['2086-149220-0033.wav'])
 print(output[0].text)
 ```
 ## <span style="color:#466f00;">Software Integration:</span>

 print(output[0].text)
 ```
+#### Streaming with Parakeet models
+To use parakeet models in streaming mode use this [script](https://github.com/NVIDIA/NeMo/blob/main/examples/asr/asr_chunked_inference/rnnt/speech_to_text_streaming_infer_rnnt.py) as shown below:
+```bash
+python NeMo/main/examples/asr/asr_chunked_inference/rnnt/speech_to_text_streaming_infer_rnnt.py \
+    pretrained_name="nvidia/parakeet-tdt-0.6b-v3" \
+    model_path=null \
+    audio_dir="<optional path to folder of audio files>" \
+    dataset_manifest="<optional path to manifest>" \
+    output_filename="<optional output filename>" \
+    right_context_secs=2.0 \
+    chunk_secs=2 \
+    left_context_secs=10.0 \
+    batch_size=32 \
+    clean_groundtruth_text=False
+```
+NVIDIA NIM for v2 parakeet model is available at [https://build.nvidia.com/nvidia/parakeet-tdt-0_6b-v2](https://build.nvidia.com/nvidia/parakeet-tdt-0_6b-v2).
 ## <span style="color:#466f00;">Software Integration:</span>