rungalileo
/

llama-3.2-3b-instruct-trtllm-ckpt-wq_nvfp4-kv_fp8

Text Generation

kv-cache-quantization

Model card Files Files and versions

llama-3.2-3b-instruct-trtllm-ckpt-wq_nvfp4-kv_fp8

Commit History

Upload folder using huggingface_hub

83a16a6
verified

rungalileo commited on Mar 16

initial commit

6e9f9be
verified

rungalileo commited on Mar 16