CDLI
Collection
This is a collection of models used for the CDLI ASR challenge for atypical speech in Uganda on Ugandan English and Luganda. • 26 items • Updated
How to use KasuleTrevor/cdli-whisper-en-ug-openai-large-v3-full-a40-fixed-nogc-clean with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="KasuleTrevor/cdli-whisper-en-ug-openai-large-v3-full-a40-fixed-nogc-clean") # Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM
processor = AutoProcessor.from_pretrained("KasuleTrevor/cdli-whisper-en-ug-openai-large-v3-full-a40-fixed-nogc-clean")
model = AutoModelForMultimodalLM.from_pretrained("KasuleTrevor/cdli-whisper-en-ug-openai-large-v3-full-a40-fixed-nogc-clean")This model is a fine-tuned version of openai/whisper-large-v3 on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 0.5356 | 1.5445 | 250 | 0.7888 | 0.3111 | 0.2203 |
| 0.4352 | 3.0866 | 500 | 0.7592 | 0.2968 | 0.2120 |
| 0.3735 | 4.6311 | 750 | 0.7697 | 0.3011 | 0.2153 |
| 0.2898 | 6.1732 | 1000 | 0.7953 | 0.2971 | 0.2138 |
| 0.2843 | 7.7177 | 1250 | 0.8035 | 0.2980 | 0.2136 |
| 0.2508 | 9.2599 | 1500 | 0.8364 | 0.2963 | 0.2133 |
Base model
openai/whisper-large-v3