File size: 1,700 Bytes
c5fc182
 
 
 
 
 
81f1dc6
57712e4
c5fc182
 
 
 
 
 
81f1dc6
 
 
c5fc182
81f1dc6
c5fc182
 
 
57712e4
c5fc182
 
 
81f1dc6
c5fc182
 
 
 
 
 
 
57712e4
 
c5fc182
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
---
license: apache-2.0
tags:
- automatic-speech-recognition
- whisper
- windyword
- hindi
- hi
library_name: transformers
pipeline_tag: automatic-speech-recognition
language:
- hi
---

# WindyWord.ai STT — Hindi Lingua (CPU INT8 (CTranslate2))

**Transcribes Hindi speech (Indo-European > Indo-Iranian > Indo-Aryan).**

> **Note:** Outputs Hindi audio as **Latin-script Hinglish, NOT Devanagari**. FLEURS-Devanagari WER ≈100% is a script mismatch, not a quality failure. Useful for code-switched / chat / SMS contexts. For Devanagari output, use a separate model (not yet shipped).

## Quality

- **WER:** unverified by WindyWord harness yet. Imported from upstream community fine-tune.

## About this variant

This is the **ct2-int8** deployment format of our Hindi Lingua STT model. Load it via the `ct2-int8/` subfolder.

Part of the [WindyWord.ai](https://windyword.ai) STT fleet — covering 35+ languages that commercial speech-to-text APIs underserve, with proper dialect / script disclosures where they matter.

## Usage

```python
from transformers import WhisperForConditionalGeneration, WhisperProcessor
processor = WhisperProcessor.from_pretrained("WindyWord/listen-windy-lingua-hi-ct2", subfolder="ct2-int8")
model = WhisperForConditionalGeneration.from_pretrained("WindyWord/listen-windy-lingua-hi-ct2", subfolder="ct2-int8")
```

## Commercial Use

Visit [windyword.ai](https://windyword.ai) for apps and API access.

---

## Provenance & License

Weights derived from upstream community Whisper fine-tunes (see individual model card for exact lineage). Redistributed under Apache-2.0 (inherited).

*Certified by Opus 4.6 Opus-Claw (Dr. C) on Veron-1 (RTX 5090, Mt Pleasant SC).*