Spaces:

ngoan
/

MedASR-Bench

Sleeping

MedASR-Bench / README.md

ngoan

MedASR Bench: full platform + Hugging Face Docker Space packaging

d70f132 12 days ago

1.35 kB

	---
	title: MedASR Bench
	emoji: 🩺
	colorFrom: indigo
	colorTo: purple
	sdk: docker
	app_port: 3000
	pinned: false
	license: mit
	short_description: Vietnamese medical ASR benchmark, code-switching aware
	---

	# MedASR Bench

	A public benchmark for **Vietnamese medical speech recognition with English
	code-switching**, anchored on the [ViMedCSS](https://huggingface.co/datasets/tensorxt/ViMedCSS)
	dataset. Part of the Clinical Scribe program.

	This Space is the display layer only — a stateless Next.js app that renders
	the leaderboard, per-model pages, and methodology. It runs on free CPU; no audio
	and no model inference ever happen here. Benchmark numbers are produced by the
	offline harness (`benchmark/medasr_eval.py`) and published as versioned,
	auditable run manifests.

	## Metrics

	WER · CER · CS-WER (code-switched-term WER) · N-WER · MTR (medical-term
	recall), each with bootstrap 95% CIs under a pinned text normalizer.

	## Running locally

	```bash
	npm install
	npm run dev # http://localhost:3000
	```

	## Build / deploy

	This Space is a Docker SDK Space. The image builds the Next.js
	[`standalone`](https://nextjs.org/docs/app/api-reference/config/next-config-js/output)
	output and serves it with `node server.js` as uid 1000 on port 3000.

	```bash
	docker build -t medasr-bench .
	docker run --rm -p 3000:3000 medasr-bench
	```