rungalileo
/

llama-3.2-3B-instruct-trtllm-ckpt-wq_int4_awq-kv_int8

Text Generation

kv-cache-quantization

Model card Files Files and versions

llama-3.2-3B-instruct-trtllm-ckpt-wq_int4_awq-kv_int8

3.07 GB

Ctrl+K

Ctrl+K

1 contributor

History: 6 commits

rungalileo's picture

Upload README.md with huggingface_hub

aa085e6 verified 4 months ago