DeepSeek-V2-Lite-NVFP4 / recipe.yaml
carlyou's picture
Add DeepSeek-V2-Lite quantized to NVFP4 via llm-compressor
d3be582 verified
Raw
History Blame Contribute Delete
170 Bytes
default_stage:
default_modifiers:
QuantizationModifier:
targets: [Linear]
ignore: [lm_head]
scheme: NVFP4
bypass_divisibility_checks: false