---
license: apache-2.0
datasets:
- Chillarmo/common_voice_20_armenian
language:
- hy
base_model:
- openai/whisper-small
pipeline_tag: automatic-speech-recognition
library_name: transformers
model-index:
- name: whisper-small-armenian
  results:
  - task:
      type: automatic-speech-recognition
      name: Automatic Speech Recognition
    dataset:
      name: Common Voice 20 Armenian
      type: Chillarmo/common_voice_20_armenian
    metrics:
    - type: wer
      value: 29.986160233738275
      name: Word Error Rate
    - type: cer
      value: 5.967044631948796
      name: Character Error Rate
    - type: exact_match
      value: 19.023224043715846
      name: Exact Match
metrics:
- wer
- cer
- exact_match
new_version: Chillarmo/whisper-small-armenian-v2
---

# Whisper-Small Fine-tuned for Armenian ASR

This model is a fine-tuned version of OpenAI's Whisper-small on the Common Voice 20 Armenian dataset for automatic speech recognition.

## Training Results

The model was trained for 5.34 epochs with the following final results:

| Metric | Value |
|--------|-------|
| **Training Loss** | 0.101 |
| **Training Runtime** | 59,610 seconds (≈16.56 hours) |
| **Training Samples/Second** | 1.34 |
| **Training Steps/Second** | 0.084 |
| **Total Training Steps** | 5,000 |
| **Epochs** | 5.34 |

## Evaluation Results

| Metric | Value |
|--------|-------|
| **Evaluation Loss** | 0.165 |
| **Word Error Rate (WER)** | 29.43% |
| **Character Error Rate (CER)** | 5.89% |
| **Exact Match** | 20.05% |
| **Average Prediction Length** | 7.71 tokens |
| **Average Label Length** | 7.77 tokens |
| **Length Ratio** | 0.992 |
| **Evaluation Runtime** | 9,486 seconds (≈2.63 hours) |
| **Evaluation Samples/Second** | 0.62 |
| **Evaluation Steps/Second** | 0.077 |

## Model Details

- **Base Model**: openai/whisper-small
- **Language**: Armenian (hy)
- **Dataset**: Chillarmo/common_voice_20_armenian
- **License**: Apache 2.0