Doing More with Less: Data Augmentation for Sudanese Dialect
Collection
22 items • Updated
How to use AymanMansour/Whisper-Sudanese-Dialect-large-v2 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="AymanMansour/Whisper-Sudanese-Dialect-large-v2") # Load model directly
from transformers import AutoProcessor, AutoModelForMultimodalLM
processor = AutoProcessor.from_pretrained("AymanMansour/Whisper-Sudanese-Dialect-large-v2")
model = AutoModelForMultimodalLM.from_pretrained("AymanMansour/Whisper-Sudanese-Dialect-large-v2")This model is a fine-tuned version of openai/whisper-large-v2 on the None dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.5167 | 1.08 | 1000 | 0.7033 | 67.2465 |
| 0.0886 | 3.04 | 2000 | 0.7730 | 51.1880 |
| 0.0808 | 4.12 | 3000 | 0.7812 | 52.5880 |
| 0.0232 | 6.08 | 4000 | 0.8798 | 40.8570 |
| 0.001 | 8.04 | 5000 | 0.9317 | 41.0267 |