DeepSeek-V2-Lite-NVFP4 / modeling_deepseek.py

Commit History

Add DeepSeek-V2-Lite quantized to NVFP4 via llm-compressor
d3be582
verified

carlyou commited on