---
language:
- hi
- en
license: apache-2.0
library_name: f5-tts
tags:
- text-to-speech
- indicf5
- hinglish
- hindi-english
- multilingual-tts
- audio
- speech
datasets:
- openslr-org/slr104
base_models:
- ai4bharat/IndicF5
---

# IndicF5-Hinglish

Fine-tuned [IndicF5](https://huggingface.co/ai4bharat/IndicF5) TTS model for **Hindi-English code-switched (Hinglish)** speech synthesis.

## Quick Start

```bash
# 1. Clone the original fine-tuning repo
git clone https://github.com/saravananravi08/indicf5-finetune
cd indicf5-finetune

# 2. Install dependencies
uv sync

# 3. Download this model from HuggingFace
huggingface-cli download Saravananravi/indicf5-hinglish model.safetensors   --local-dir ./checkpoints

# 4. Generate speech
uv run evaluate --checkpoint-dir ./checkpoints
```

## Usage Example

```python
from indicf5_finetune.evaluate import load_model_from_checkpoint

model, step = load_model_from_checkpoint(
    checkpoint_dir="./checkpoints",
    vocab_path="path/to/vocab.txt"
)

# Synthesize
audio = model.generate("मैं आज office जा रहा हूँ")
```

## Requirements

- Python 3.10+
- PyTorch 2.2+
- f5_tts (bundled in the repo)
- Vocos vocoder
- CUDA GPU (recommended, works on CPU with reduced speed)

## Model Details

| Property | Value |
|----------|-------|
| Base Model | ai4bharat/IndicF5 |
| Training | OpenSLR-104 (Hindi) |
| Steps | 32,990 |
| Language | Hindi + English (Hinglish) |
| Sample Rate | 24kHz |

## License

Apache 2.0

## Citation

```bibtex
@software{indicf5_hinglish,
  title={IndicF5-Hinglish},
  author={Saravanan Ravi},
  year={2025},
  url={https://huggingface.co/Saravananravi/indicf5-hinglish}
}
```