llm-semantic-router
/

modernbert-base-32k-haldetect

Token Classification

hallucination-detection

Eval Results (legacy)

Model card Files Files and versions

HuaminChen commited on Jan 9

Commit

5ffa6da

·

verified ·

1 Parent(s): 6097a2d

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -11,6 +11,9 @@ tags:
 - token-classification
 - long-context
 - 32k
 datasets:
 - RAGTruth
 - llm-semantic-router/longcontext-haldetect
@@ -158,8 +161,9 @@ early_stopping_patience: 3
 ```
 ### Hardware
-- AMD MI300X GPU (196GB VRAM)
 - Training time: ~20 minutes
 ## When to Use This Model

 - token-classification
 - long-context
 - 32k
+- amd
+- rocm
+- mi300x
 datasets:
 - RAGTruth
 - llm-semantic-router/longcontext-haldetect
 ```
 ### Hardware
+- **AMD Instinct MI300X GPU** (192GB HBM3) - Trained entirely on AMD ROCm
 - Training time: ~20 minutes
+- Framework: PyTorch + HuggingFace Transformers on ROCm 6.0
 ## When to Use This Model