MedASR-Bench / README.md
ngoan
MedASR Bench: full platform + Hugging Face Docker Space packaging
d70f132
|
Raw
History Blame Contribute Delete
1.35 kB
---
title: MedASR Bench
emoji: 🩺
colorFrom: indigo
colorTo: purple
sdk: docker
app_port: 3000
pinned: false
license: mit
short_description: Vietnamese medical ASR benchmark, code-switching aware
---
# MedASR Bench
A public benchmark for **Vietnamese medical speech recognition with English
code-switching**, anchored on the [ViMedCSS](https://huggingface.co/datasets/tensorxt/ViMedCSS)
dataset. Part of the **Clinical Scribe** program.
This Space is the **display layer** only — a stateless Next.js app that renders
the leaderboard, per-model pages, and methodology. It runs on free CPU; no audio
and no model inference ever happen here. Benchmark numbers are produced by the
offline harness (`benchmark/medasr_eval.py`) and published as versioned,
auditable run manifests.
## Metrics
WER · CER · CS-WER (code-switched-term WER) · N-WER · MTR (medical-term
recall), each with bootstrap 95% CIs under a pinned text normalizer.
## Running locally
```bash
npm install
npm run dev # http://localhost:3000
```
## Build / deploy
This Space is a Docker SDK Space. The image builds the Next.js
[`standalone`](https://nextjs.org/docs/app/api-reference/config/next-config-js/output)
output and serves it with `node server.js` as uid 1000 on port 3000.
```bash
docker build -t medasr-bench .
docker run --rm -p 3000:3000 medasr-bench
```