cdli/kenyan_english_nonstandard_speech_v1.0
Viewer • Updated • 6k • 17
How to use KasuleTrevor/cdli-whisper-en-ug-ke-sunbird-full-a40 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="KasuleTrevor/cdli-whisper-en-ug-ke-sunbird-full-a40") # Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM
processor = AutoProcessor.from_pretrained("KasuleTrevor/cdli-whisper-en-ug-ke-sunbird-full-a40")
model = AutoModelForMultimodalLM.from_pretrained("KasuleTrevor/cdli-whisper-en-ug-ke-sunbird-full-a40")This model is a fine-tuned version of Sunbird/asr-whisper-large-v3-salt on the None dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 0.7169 | 0.4248 | 250 | 0.8477 | 0.2903 | 0.1997 |
| 0.6641 | 0.8496 | 500 | 0.8096 | 0.2821 | 0.1963 |
| 0.547 | 1.2736 | 750 | 0.8075 | 0.2777 | 0.1892 |
| 0.5109 | 1.6984 | 1000 | 0.7924 | 0.2720 | 0.1875 |
| 0.4612 | 2.1223 | 1250 | 0.7972 | 0.2656 | 0.1796 |
| 0.4589 | 2.5472 | 1500 | 0.8042 | 0.2666 | 0.1818 |
| 0.5135 | 2.9720 | 1750 | 0.7934 | 0.2710 | 0.1859 |
| 0.445 | 3.3959 | 2000 | 0.8043 | 0.2657 | 0.1813 |
| 0.4636 | 3.8207 | 2250 | 0.8070 | 0.2677 | 0.1831 |
| 0.371 | 4.2447 | 2500 | 0.8112 | 0.2655 | 0.1812 |
| 0.4372 | 4.6695 | 2750 | 0.8133 | 0.2672 | 0.1824 |
| 0.3861 | 5.0935 | 3000 | 0.8134 | 0.2654 | 0.1807 |