nvidia
/

parakeet-tdt-0.6b-v2

Automatic Speech Recognition

hf-asr-leaderboard

Eval Results (legacy)

Model card Files Files and versions

nithinraok commited on May 16, 2025

Commit

a875816

·

verified ·

1 Parent(s): 17d6235

Update README.md

Update wording and fix GPU count.

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -290,8 +290,8 @@ Current version: parakeet-tdt-0.6b-v2. Previous versions can be [accessed](https
 This model was trained using the NeMo toolkit [3], following the strategies below:
-- Initialized from a wav2vec SSL checkpoint pretrained on the LibriLight dataset[7].
-- Trained for 150,000 steps on 128 A100 GPUs.
 - Dataset corpora were balanced using a temperature sampling value of 0.5.
 - Stage 2 fine-tuning was performed for 2,500 steps on 4 A100 GPUs using approximately 500 hours of high-quality, human-transcribed data of NeMo ASR Set 3.0.

 This model was trained using the NeMo toolkit [3], following the strategies below:
+- Initialized from a FastConformer SSL checkpoint that was pretrained with a wav2vev method on the LibriLight dataset[7].
+- Trained for 150,000 steps on 64 A100 GPUs.
 - Dataset corpora were balanced using a temperature sampling value of 0.5.
 - Stage 2 fine-tuning was performed for 2,500 steps on 4 A100 GPUs using approximately 500 hours of high-quality, human-transcribed data of NeMo ASR Set 3.0.