Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -11,6 +11,9 @@ tags:
|
|
| 11 |
- token-classification
|
| 12 |
- long-context
|
| 13 |
- 32k
|
|
|
|
|
|
|
|
|
|
| 14 |
datasets:
|
| 15 |
- RAGTruth
|
| 16 |
- llm-semantic-router/longcontext-haldetect
|
|
@@ -158,8 +161,9 @@ early_stopping_patience: 3
|
|
| 158 |
```
|
| 159 |
|
| 160 |
### Hardware
|
| 161 |
-
- AMD MI300X GPU (
|
| 162 |
- Training time: ~20 minutes
|
|
|
|
| 163 |
|
| 164 |
## When to Use This Model
|
| 165 |
|
|
|
|
| 11 |
- token-classification
|
| 12 |
- long-context
|
| 13 |
- 32k
|
| 14 |
+
- amd
|
| 15 |
+
- rocm
|
| 16 |
+
- mi300x
|
| 17 |
datasets:
|
| 18 |
- RAGTruth
|
| 19 |
- llm-semantic-router/longcontext-haldetect
|
|
|
|
| 161 |
```
|
| 162 |
|
| 163 |
### Hardware
|
| 164 |
+
- **AMD Instinct MI300X GPU** (192GB HBM3) - Trained entirely on AMD ROCm
|
| 165 |
- Training time: ~20 minutes
|
| 166 |
+
- Framework: PyTorch + HuggingFace Transformers on ROCm 6.0
|
| 167 |
|
| 168 |
## When to Use This Model
|
| 169 |
|