cdli/kenyan_english_nonstandard_speech_v1.0
Viewer • Updated • 6k • 25
How to use KasuleTrevor/cdli-whisper-en-ug-ke-sunbird-encoder-a40 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="KasuleTrevor/cdli-whisper-en-ug-ke-sunbird-encoder-a40") # Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM
processor = AutoProcessor.from_pretrained("KasuleTrevor/cdli-whisper-en-ug-ke-sunbird-encoder-a40")
model = AutoModelForMultimodalLM.from_pretrained("KasuleTrevor/cdli-whisper-en-ug-ke-sunbird-encoder-a40")This model is a fine-tuned version of Sunbird/asr-whisper-large-v3-salt on the None dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 0.8367 | 0.4248 | 250 | 0.9348 | 0.3188 | 0.2204 |
| 0.7745 | 0.8496 | 500 | 0.9045 | 0.3089 | 0.2130 |
| 0.6983 | 1.2736 | 750 | 0.9017 | 0.3057 | 0.2104 |
| 0.6503 | 1.6984 | 1000 | 0.8888 | 0.2970 | 0.2038 |
| 0.6562 | 2.1223 | 1250 | 0.8883 | 0.3022 | 0.2087 |
| 0.6649 | 2.5472 | 1500 | 0.8885 | 0.2978 | 0.2055 |
| 0.7165 | 2.9720 | 1750 | 0.8887 | 0.2976 | 0.2051 |
| 0.6558 | 3.3959 | 2000 | 0.8885 | 0.2958 | 0.2031 |
| 0.691 | 3.8207 | 2250 | 0.8895 | 0.2948 | 0.2019 |
| 0.5922 | 4.2447 | 2500 | 0.8885 | 0.2945 | 0.2021 |