Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -20,6 +20,7 @@ You might need to build from source as it is not included in the nightly build y
|
|
| 20 |
Alternatively, patch the latest nightly image yourself to include that PR.
|
| 21 |
|
| 22 |
## Changelog
|
|
|
|
| 23 |
- **25/02/2026**: Added missing processor configs (`preprocessor_config.json`, `video_preprocessor_config.json`, `processor_config.json`), `vocab.json`, and restored full `tokenizer_config.json` from the base model. Fixes vision/video input support and tokenizer loading issues.
|
| 24 |
- **22/02/2026**: Re-quantized with improved calibration data and parameters. Fixed 14 Inf `input_global_scale` values caused by rarely-activated experts receiving all-zero activations during calibration. All 92,400 scale tensors now valid. Fixed `tokenizer_class` and added `mlp.gate` to quantization ignore list in `config.json`.
|
| 25 |
- **20/02/2026**: Reuploaded weights with some issues fixed.
|
|
|
|
| 20 |
Alternatively, patch the latest nightly image yourself to include that PR.
|
| 21 |
|
| 22 |
## Changelog
|
| 23 |
+
- **02/03/2026**: Added MTP (multi-token prediction) weights from source checkpoint, enabling speculative decoding with vLLM.
|
| 24 |
- **25/02/2026**: Added missing processor configs (`preprocessor_config.json`, `video_preprocessor_config.json`, `processor_config.json`), `vocab.json`, and restored full `tokenizer_config.json` from the base model. Fixes vision/video input support and tokenizer loading issues.
|
| 25 |
- **22/02/2026**: Re-quantized with improved calibration data and parameters. Fixed 14 Inf `input_global_scale` values caused by rarely-activated experts receiving all-zero activations during calibration. All 92,400 scale tensors now valid. Fixed `tokenizer_class` and added `mlp.gate` to quantization ignore list in `config.json`.
|
| 26 |
- **20/02/2026**: Reuploaded weights with some issues fixed.
|