--- language: bxk tags: - audio - automatic-speech-recognition - mms - adapter license: cc-by-nc-4.0 datasets: - mozilla-foundation/common_voice_spontaneous_speech --- # MMS Adapter Fine-tuned for Bukusu This model is a fine-tuned version of [facebook/mms-1b-all](https://huggingface.co/facebook/mms-1b-all) on the Mozilla Common Voice Spontaneous Speech dataset for Bukusu (bxk). ## Training - Base model: facebook/mms-1b-all - Fine-tuning method: Adapter layers - Dataset: Mozilla Common Voice Spontaneous Speech ## Usage ```python from transformers import Wav2Vec2ForCTC, Wav2Vec2Processor import torch processor = Wav2Vec2Processor.from_pretrained("vitthalbhandari/mms-1b-all-aft-mid-bxk") model = Wav2Vec2ForCTC.from_pretrained("vitthalbhandari/mms-1b-all-aft-mid-bxk") # Load adapter model.load_adapter("bxk") # Transcribe audio inputs = processor(audio_array, sampling_rate=16000, return_tensors="pt") with torch.no_grad(): logits = model(**inputs).logits predicted_ids = torch.argmax(logits, dim=-1) transcription = processor.batch_decode(predicted_ids) ```