Beambutbetter
/

Deepseek-V2-Lite-16B-NVFP4

Text Generation

8-bit precision

Model card Files Files and versions

Deepseek-V2-Lite-16B-NVFP4 / hf_quant_config.json

Beambutbetter's picture

Upload folder using huggingface_hub

d1be4a0 verified 9 months ago

History Blame Contribute Delete

287 Bytes

	{
	"producer": {
	"name": "modelopt",
	"version": "0.39.0.dev12+g9e64f81a6"
	},
	"quantization": {
	"quant_algo": "NVFP4",
	"kv_cache_quant_algo": "NVFP4",
	"group_size": 16,
	"exclude_modules": [
	"lm_head"
	]
	}
	}