rungalileo
/

llama-3.2-3b-instruct-trtllm-ckpt-wq_nvfp4-kv_fp8

Text Generation

kv-cache-quantization

Model card Files Files and versions

llama-3.2-3b-instruct-trtllm-ckpt-wq_nvfp4-kv_fp8

3.47 GB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

rungalileo's picture

Upload folder using huggingface_hub

83a16a6 verified 3 months ago