--- title: MedASR Bench emoji: 🩺 colorFrom: indigo colorTo: purple sdk: docker app_port: 3000 pinned: false license: mit short_description: Vietnamese medical ASR benchmark, code-switching aware --- # MedASR Bench A public benchmark for **Vietnamese medical speech recognition with English code-switching**, anchored on the [ViMedCSS](https://huggingface.co/datasets/tensorxt/ViMedCSS) dataset. Part of the **Clinical Scribe** program. This Space is the **display layer** only — a stateless Next.js app that renders the leaderboard, per-model pages, and methodology. It runs on free CPU; no audio and no model inference ever happen here. Benchmark numbers are produced by the offline harness (`benchmark/medasr_eval.py`) and published as versioned, auditable run manifests. ## Metrics WER · CER · CS-WER (code-switched-term WER) · N-WER · MTR (medical-term recall), each with bootstrap 95% CIs under a pinned text normalizer. ## Running locally ```bash npm install npm run dev # http://localhost:3000 ``` ## Build / deploy This Space is a Docker SDK Space. The image builds the Next.js [`standalone`](https://nextjs.org/docs/app/api-reference/config/next-config-js/output) output and serves it with `node server.js` as uid 1000 on port 3000. ```bash docker build -t medasr-bench . docker run --rm -p 3000:3000 medasr-bench ```