DeepSeek-V2-Lite-FP8-Group / configuration_deepseek.py

Commit History

Add DeepSeek-V2-Lite quantized to per-group FP8 (group_size=64) via llm-compressor
68c5c3a
verified

carlyou commited on