Automatic Speech Recognition
Transformers
NeMo
Safetensors
PyTorch
parakeet_tdt
feature-extraction
speech
audio
Transducer
Transformer
TDT
FastConformer
Conformer
NeMo
hf-asr-leaderboard
Transformers
Eval Results (legacy)
Eval Results
Instructions to use nvidia/parakeet-tdt-0.6b-v3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use nvidia/parakeet-tdt-0.6b-v3 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="nvidia/parakeet-tdt-0.6b-v3")# Load model directly from transformers import AutoModelForMultimodalLM model = AutoModelForMultimodalLM.from_pretrained("nvidia/parakeet-tdt-0.6b-v3", dtype="auto") - Inference
- Notebooks
- Google Colab
- Kaggle
add title
Browse filesSigned-off-by: monica-sekoyan <msekoyan@nvidia.com>
README.md
CHANGED
|
@@ -819,6 +819,8 @@ This model is ready for commercial/non-commercial use.
|
|
| 819 |
|
| 820 |
---
|
| 821 |
|
|
|
|
|
|
|
| 822 |

|
| 823 |
|
| 824 |
*Figure 1: ASR WER comparison across different models. This does not include Punctuation and Capitalisation errors.*
|
|
|
|
| 819 |
|
| 820 |
---
|
| 821 |
|
| 822 |
+
## Automatic Speech Recognition (ASR) Performance
|
| 823 |
+
|
| 824 |

|
| 825 |
|
| 826 |
*Figure 1: ASR WER comparison across different models. This does not include Punctuation and Capitalisation errors.*
|