openslr/openslr
Updated • 461 • 29
How to use proxectonos/Nos_ASR-wav2vec2-large-xlsr-53-gl-with-lm with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="proxectonos/Nos_ASR-wav2vec2-large-xlsr-53-gl-with-lm") # Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("proxectonos/Nos_ASR-wav2vec2-large-xlsr-53-gl-with-lm")
model = AutoModelForCTC.from_pretrained("proxectonos/Nos_ASR-wav2vec2-large-xlsr-53-gl-with-lm")# Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("proxectonos/Nos_ASR-wav2vec2-large-xlsr-53-gl-with-lm")
model = AutoModelForCTC.from_pretrained("proxectonos/Nos_ASR-wav2vec2-large-xlsr-53-gl-with-lm")This is a copy of the model diego-fustes/wav2vec2-large-xlsr-gl with an integrated language model.
Improvement This model has been compared with the baseline (diego-fustes/wav2vec2-large-xlsr-gl) on:
The results are shown in the following tables:
OpenSLR77:
| Model | WER | CER |
|---|---|---|
| diego-fustes/wav2vec2-large-xlsr-gl | 9.10% | 3.94% |
| cmagui/wav2vec2-large-xlsr-53-galician-with-lm | 6.86% | 2.20% |
Common_voice-gl:
| Model | WER | CER |
|---|---|---|
| diego-fustes/wav2vec2-large-xlsr-gl | 22.12% | 5.09% |
| cmagui/wav2vec2-large-xlsr-53-galician-with-lm | 15.20% | 3.87% |
Citation information
If you use this model, please cite as follows:
Magariños, Carmen; Gamallo, Pablo. 2023. Nos_ASR-wav2vec2-large-xlsr-53-gl-with-lm. URL: https://huggingface.co/proxectonos/Nos_ASR-wav2vec2-large-xlsr-53-gl-with-lm
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="proxectonos/Nos_ASR-wav2vec2-large-xlsr-53-gl-with-lm")