Token Classification
Transformers
Safetensors
PyTorch
Hindi
bert
ner
pii
pii-detection
de-identification
privacy
healthcare
medical
clinical
phi
hindi
openmed
Eval Results (legacy)
Instructions to use OpenMed/OpenMed-PII-Hindi-BioClinicalBERT-Base-110M-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenMed/OpenMed-PII-Hindi-BioClinicalBERT-Base-110M-v1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("token-classification", model="OpenMed/OpenMed-PII-Hindi-BioClinicalBERT-Base-110M-v1")# Load model directly from transformers import AutoTokenizer, AutoModelForTokenClassification tokenizer = AutoTokenizer.from_pretrained("OpenMed/OpenMed-PII-Hindi-BioClinicalBERT-Base-110M-v1") model = AutoModelForTokenClassification.from_pretrained("OpenMed/OpenMed-PII-Hindi-BioClinicalBERT-Base-110M-v1") - Notebooks
- Google Colab
- Kaggle
| Classification Report for Hindi PII Detection | |
| Model: emilyalsentzer/Bio_ClinicalBERT | |
| ============================================================ | |
| precision recall f1-score support | |
| AGE 0.96 0.99 0.98 193 | |
| BUILDINGNUMBER 0.99 0.97 0.98 235 | |
| CITY 0.90 0.94 0.92 373 | |
| CREDITCARD 0.97 1.00 0.98 56 | |
| DATE 0.99 1.00 0.99 273 | |
| EMAIL 1.00 1.00 1.00 286 | |
| FIRSTNAME 0.90 0.94 0.92 2681 | |
| GENDER 0.99 0.99 0.99 147 | |
| LASTNAME 0.83 0.75 0.79 929 | |
| MASKEDNUMBER 0.94 0.64 0.77 160 | |
| PHONE 0.98 0.99 0.98 447 | |
| PREFIX 0.92 0.87 0.89 165 | |
| SSN 0.84 0.96 0.90 361 | |
| STREET 0.96 0.97 0.96 266 | |
| TIME 1.00 1.00 1.00 905 | |
| ZIPCODE 1.00 0.99 1.00 110 | |
| micro avg 0.93 0.93 0.93 7587 | |
| macro avg 0.95 0.94 0.94 7587 | |
| weighted avg 0.93 0.93 0.93 7587 | |