DeepSeek-V2-Lite-FP8-Group / generation_config.json
carlyou's picture
Add DeepSeek-V2-Lite quantized to per-group FP8 (group_size=64) via llm-compressor
68c5c3a verified
Raw
History Blame
181 Bytes
{
"_from_model_config": true,
"bos_token_id": 100000,
"do_sample": true,
"eos_token_id": 100001,
"temperature": 0.3,
"top_p": 0.95,
"transformers_version": "4.57.6"
}