CDLI
Collection
This is a collection of models used for the CDLI ASR challenge for atypical speech in Uganda on Ugandan English and Luganda. • 26 items • Updated
How to use KasuleTrevor/cdli-whisper-en-ug-sunbird-encoder-a40 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="KasuleTrevor/cdli-whisper-en-ug-sunbird-encoder-a40") # Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM
processor = AutoProcessor.from_pretrained("KasuleTrevor/cdli-whisper-en-ug-sunbird-encoder-a40")
model = AutoModelForMultimodalLM.from_pretrained("KasuleTrevor/cdli-whisper-en-ug-sunbird-encoder-a40")This model is a fine-tuned version of Sunbird/asr-whisper-large-v3-salt on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 0.7206 | 0.7734 | 250 | 0.9708 | 0.3777 | 0.2721 |
| 0.5798 | 1.5445 | 500 | 0.9283 | 0.3481 | 0.2501 |
| 0.5444 | 2.3155 | 750 | 0.9325 | 0.3467 | 0.2508 |
| 0.5114 | 3.0866 | 1000 | 0.9377 | 0.3461 | 0.2485 |
| 0.4933 | 3.8600 | 1250 | 0.9307 | 0.3341 | 0.2398 |
| 0.4684 | 4.6311 | 1500 | 0.9362 | 0.3433 | 0.2463 |
| 0.4699 | 5.4022 | 1750 | 0.9397 | 0.3278 | 0.2327 |
| 0.4224 | 6.1732 | 2000 | 0.9451 | 0.3331 | 0.2363 |
| 0.4961 | 6.9466 | 2250 | 0.9438 | 0.3339 | 0.2372 |
| 0.4316 | 7.7177 | 2500 | 0.9443 | 0.3339 | 0.2373 |