razhan/DOLMA-speech
Viewer • Updated • 41.7k • 42 • 1
How to use razhan/whisper-base-hawrami-transcription with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="razhan/whisper-base-hawrami-transcription") # Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM
processor = AutoProcessor.from_pretrained("razhan/whisper-base-hawrami-transcription")
model = AutoModelForMultimodalLM.from_pretrained("razhan/whisper-base-hawrami-transcription")This model is a fine-tuned version of razhan/whisper-base-hawrami on the razhan/DOLMA-speech hawrami dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 0.6798 | 1.0 | 80 | 0.5513 | 0.6855 | 0.1788 |
| 0.3095 | 2.0 | 160 | 0.2984 | 0.4486 | 0.0972 |
| 0.2673 | 3.0 | 240 | 0.2676 | 0.4143 | 0.0882 |
| 0.2428 | 4.0 | 320 | 0.2612 | 0.4013 | 0.0856 |
Base model
openai/whisper-base