Automatic Speech Recognition
Transformers
NeMo
Safetensors
PyTorch
parakeet_tdt
feature-extraction
speech
audio
Transducer
Transformer
TDT
FastConformer
Conformer
NeMo
hf-asr-leaderboard
Transformers
Eval Results (legacy)
Eval Results
Instructions to use nvidia/parakeet-tdt-0.6b-v3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use nvidia/parakeet-tdt-0.6b-v3 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="nvidia/parakeet-tdt-0.6b-v3")# Load model directly from transformers import AutoModelForMultimodalLM model = AutoModelForMultimodalLM.from_pretrained("nvidia/parakeet-tdt-0.6b-v3", dtype="auto") - Inference
- Notebooks
- Google Colab
- Kaggle
nithinraok commited on
Commit ·
bb0964b
1
Parent(s): 7938c10
Add streaming inference info
Browse filesSigned-off-by: nithinraok <nithinrao.koluguri@gmail.com>
README.md
CHANGED
|
@@ -927,8 +927,25 @@ output = asr_model.transcribe(['2086-149220-0033.wav'])
|
|
| 927 |
print(output[0].text)
|
| 928 |
```
|
| 929 |
|
|
|
|
| 930 |
|
|
|
|
| 931 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 932 |
|
| 933 |
## <span style="color:#466f00;">Software Integration:</span>
|
| 934 |
|
|
|
|
| 927 |
print(output[0].text)
|
| 928 |
```
|
| 929 |
|
| 930 |
+
#### Streaming with Parakeet models
|
| 931 |
|
| 932 |
+
To use parakeet models in streaming mode use this [script](https://github.com/NVIDIA/NeMo/blob/main/examples/asr/asr_chunked_inference/rnnt/speech_to_text_streaming_infer_rnnt.py) as shown below:
|
| 933 |
|
| 934 |
+
```bash
|
| 935 |
+
python NeMo/main/examples/asr/asr_chunked_inference/rnnt/speech_to_text_streaming_infer_rnnt.py \
|
| 936 |
+
pretrained_name="nvidia/parakeet-tdt-0.6b-v3" \
|
| 937 |
+
model_path=null \
|
| 938 |
+
audio_dir="<optional path to folder of audio files>" \
|
| 939 |
+
dataset_manifest="<optional path to manifest>" \
|
| 940 |
+
output_filename="<optional output filename>" \
|
| 941 |
+
right_context_secs=2.0 \
|
| 942 |
+
chunk_secs=2 \
|
| 943 |
+
left_context_secs=10.0 \
|
| 944 |
+
batch_size=32 \
|
| 945 |
+
clean_groundtruth_text=False
|
| 946 |
+
```
|
| 947 |
+
|
| 948 |
+
NVIDIA NIM for v2 parakeet model is available at [https://build.nvidia.com/nvidia/parakeet-tdt-0_6b-v2](https://build.nvidia.com/nvidia/parakeet-tdt-0_6b-v2).
|
| 949 |
|
| 950 |
## <span style="color:#466f00;">Software Integration:</span>
|
| 951 |
|