nvidia
/

parakeet-tdt-0.6b-v2

Automatic Speech Recognition

hf-asr-leaderboard

Eval Results (legacy)

Model card Files Files and versions

nithinraok commited on Apr 13

Commit

1b149a3

·

verified ·

1 Parent(s): 48b630d

Update README.md

Files changed (1) hide show

README.md +60 -0

README.md CHANGED Viewed

@@ -268,6 +268,66 @@ for stamp in segment_timestamps:
     print(f"{stamp['start']}s - {stamp['end']}s : {stamp['segment']}")
 ```
 ## <span style="color:#466f00;">Software Integration:</span>

     print(f"{stamp['start']}s - {stamp['end']}s : {stamp['segment']}")
 ```
+## <span style="color:#466f00;">Try via API — No Setup Required</span>
+Transcribe audio instantly using the free hosted API on [build.nvidia.com](https://build.nvidia.com/nvidia/parakeet-tdt-0_6b-v2) — no GPU, no Docker, no model download needed.
+**1. Get a free API key:** Visit [build.nvidia.com/nvidia/parakeet-tdt-0_6b-v2](https://build.nvidia.com/nvidia/parakeet-tdt-0_6b-v2) and click **Get API Key**
+**2. Install the Riva client:**
+```bash
+pip install nvidia-riva-client
+```
+**3. Transcribe an audio file:**
+```python
+import riva.client
+auth = riva.client.Auth(
+    uri="grpc.nvcf.nvidia.com:443",
+    use_ssl=True,
+    metadata_args=[
+        ["function-id", "d3fe9151-442b-4204-a70d-5fcc597fd610"],
+        ["authorization", "Bearer nvapi-YOUR_API_KEY"]
+    ]
+)
+asr_service = riva.client.ASRService(auth)
+with open("audio.wav", "rb") as f:
+    audio = f.read()
+config = riva.client.RecognitionConfig(
+    language_code="en-US",
+    max_alternatives=1,
+    enable_automatic_punctuation=True,
+    enable_word_time_offsets=True,
+)
+response = asr_service.offline_recognize(audio, config)
+print(response.results[0].alternatives[0].transcript)
+```
+**Or use the CLI:**
+```bash
+git clone https://github.com/nvidia-riva/python-clients.git
+export NVIDIA_API_KEY="nvapi-YOUR_API_KEY"
+python python-clients/scripts/asr/transcribe_file_offline.py \
+    --server grpc.nvcf.nvidia.com:443 --use-ssl \
+    --metadata function-id "d3fe9151-442b-4204-a70d-5fcc597fd610" \
+    --metadata "authorization" "Bearer $NVIDIA_API_KEY" \
+    --language-code en-US \
+    --word-time-offsets --automatic-punctuation \
+    --input-file audio.wav
+```
+> **Note:** The hosted API accepts 16-bit mono audio in WAV, OGG, or OPUS format. See the [API Reference](https://docs.nvidia.com/nim/riva/asr/latest/protos.html) for streaming and advanced options.
 ## <span style="color:#466f00;">Software Integration:</span>