Automatic Speech Recognition
Transformers
NeMo
Safetensors
PyTorch
parakeet_tdt
feature-extraction
speech
audio
Transducer
Transformer
TDT
FastConformer
Conformer
NeMo
hf-asr-leaderboard
Transformers
Eval Results (legacy)
Eval Results
Instructions to use nvidia/parakeet-tdt-0.6b-v3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use nvidia/parakeet-tdt-0.6b-v3 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="nvidia/parakeet-tdt-0.6b-v3")# Load model directly from transformers import AutoModelForMultimodalLM model = AutoModelForMultimodalLM.from_pretrained("nvidia/parakeet-tdt-0.6b-v3", dtype="auto") - Inference
- Notebooks
- Google Colab
- Kaggle
A quantized and deployable version of the model on Nvidia Orins with TensorRT
#44 opened 23 days ago
by
quantshah
The model sometimes drops full sentences
#42 opened about 1 month ago
by
nenad1002
CUDA out of memory with long audio
#41 opened about 2 months ago
by
CarlsMM7
Model support per CrispASR β pure C++ inference with GGUF (no Python/NeMo needed)
#38 opened about 2 months ago
by
cstr
Add Open ASR Leaderboard evaluation results
#36 opened 2 months ago
by
SaylorTwift
Environment setup
1
#35 opened 2 months ago
by
pr-tet-usr
Inconsistent number transcription - often letters instead of digits (french language)
#34 opened 3 months ago
by
poulpor
requirements.txt ? What Python version does this need?
2
#33 opened 3 months ago
by
Jhaut
eror timestamp
1
#32 opened 4 months ago
by
ghoza
Word boosting/Custom vocabulary
π₯ 4
1
#31 opened 5 months ago
by
buzzb0x
I want transcription not translation
π 5
1
#29 opened 6 months ago
by
smartire
Update Readme
#27 opened 7 months ago
by
jbalam-nv
How to specify the output language?
π 4
1
#26 opened 7 months ago
by
dragonhunterau
Will there be support for other languages?
ππ 9
2
#25 opened 7 months ago
by
altunenes
First 273 vocabulary tokens
#24 opened 7 months ago
by
comodoro
Does it support Realtime ?
1
#23 opened 7 months ago
by
ism0il
Separate languages into distinct models. How?
#22 opened 7 months ago
by
mv24
Running parakeet-tdt-0.6b-v3 on Jetson AGX Orin, Thor, or Spark!
4
#21 opened 7 months ago
by
raymondlo84
Seeking a Clear Tutorial for Fine-Tuning NVIDIA NeMo Models on New English Audio Domains
1
#19 opened 8 months ago
by
jacktol
Streaming question
1
#18 opened 8 months ago
by
koifish12
training script
1
#17 opened 8 months ago
by
sugintama
[EXAMPLE] Working streaming POC with Gradio MIC input.
#16 opened 8 months ago
by
WJ88
Can support for Irish Gaelic be added?
1
#15 opened 9 months ago
by
cgiwouter
Fine-tune on the other Language
3
#14 opened 9 months ago
by
Chonlasitk
Questions about streaming with Parakeet and TDT merging methods
π 1
2
#13 opened 9 months ago
by
alexandreacff
Missing sentences when transcribe some audios
β 8
#12 opened 9 months ago
by
josscii
Streaming?
1
#11 opened 9 months ago
by
dyqiang
Question about inference speed
1
#10 opened 10 months ago
by
cX1y
Async streaming container?
#9 opened 10 months ago
by
lukiggs
Is it possible to prompt or output language?
ππ 8
1
#8 opened 10 months ago
by
ndlc
Local Installation Video and Testing - Step by Step
β€οΈ 2
1
#6 opened 10 months ago
by
fahdmirzac
Japanese support plan?
π 5
6
#5 opened 10 months ago
by
sttt
Recognise separate voices
1
#4 opened 10 months ago
by
Jappie
Word boosting
βπ 2
2
#3 opened 10 months ago
by
stefanr123
training hyper-parameters
#2 opened 10 months ago
by
sugintama
Code switching
π 1
1
#1 opened 10 months ago
by
pscar