musaw
docs: make all links clickable and add structured resource/docs tooling
d2f0b77
|
Raw
History Blame Contribute Delete
1.53 kB

πŸŽ™οΈ ASR Workspace

Place ASR baselines, training configs, and evaluation scripts here.

βœ… Verified Pashto-Relevant ASR Models

🧠 OpenAI Whisper Large v3

🌐 Meta MMS Coverage (ASR + TTS language support)

  • Coverage page: MMS language coverage
  • Pashto validation: row includes pus with ASR and TTS support.
  • Use in this repo: multilingual transfer baseline when Pashto data is limited.
  • Applications: low-resource ASR transfer experiments.

βš™οΈ Verified Inference Tooling

πŸš€ Faster-Whisper

  • Repo: github.com/SYSTRAN/faster-whisper
  • Why useful: optimized Whisper inference for faster experimentation.
  • Use in this repo: local transcription pipelines and benchmark generation speedups.

🧩 Integration Hints

  • Keep all model/eval runs reproducible with command logs and commit hashes.
  • Store evaluation outputs under benchmarks/ with model/version labels.
  • Track WER/CER with dataset split and normalization policy references.