nithinraok
commited on
Commit
·
bb0964b
1
Parent(s):
7938c10
Add streaming inference info
Browse filesSigned-off-by: nithinraok <[email protected]>
README.md
CHANGED
@@ -927,8 +927,25 @@ output = asr_model.transcribe(['2086-149220-0033.wav'])
|
|
927 |
print(output[0].text)
|
928 |
```
|
929 |
|
|
|
930 |
|
|
|
931 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
932 |
|
933 |
## <span style="color:#466f00;">Software Integration:</span>
|
934 |
|
|
|
927 |
print(output[0].text)
|
928 |
```
|
929 |
|
930 |
+
#### Streaming with Parakeet models
|
931 |
|
932 |
+
To use parakeet models in streaming mode use this [script](https://github.com/NVIDIA/NeMo/blob/main/examples/asr/asr_chunked_inference/rnnt/speech_to_text_streaming_infer_rnnt.py) as shown below:
|
933 |
|
934 |
+
```bash
|
935 |
+
python NeMo/main/examples/asr/asr_chunked_inference/rnnt/speech_to_text_streaming_infer_rnnt.py \
|
936 |
+
pretrained_name="nvidia/parakeet-tdt-0.6b-v3" \
|
937 |
+
model_path=null \
|
938 |
+
audio_dir="<optional path to folder of audio files>" \
|
939 |
+
dataset_manifest="<optional path to manifest>" \
|
940 |
+
output_filename="<optional output filename>" \
|
941 |
+
right_context_secs=2.0 \
|
942 |
+
chunk_secs=2 \
|
943 |
+
left_context_secs=10.0 \
|
944 |
+
batch_size=32 \
|
945 |
+
clean_groundtruth_text=False
|
946 |
+
```
|
947 |
+
|
948 |
+
NVIDIA NIM for v2 parakeet model is available at [https://build.nvidia.com/nvidia/parakeet-tdt-0_6b-v2](https://build.nvidia.com/nvidia/parakeet-tdt-0_6b-v2).
|
949 |
|
950 |
## <span style="color:#466f00;">Software Integration:</span>
|
951 |
|