Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
caiovicentino1
/
Qwen3.5-9B-Neo-HLWQ-Q5
like
5
Text Generation
qwen3_5_text
hlwq
qwen3.5
quantized
kv-cache-compression
conversational
polarengine
arxiv:
2502.02617
arxiv:
2603.29078
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
Qwen3.5-9B-Neo-HLWQ-Q5
6.27 GB
Ctrl+K
Ctrl+K
1 contributor
History:
15 commits
caiovicentino1
Remove legacy polar_config.json
3355841
verified
2 months ago
.gitattributes
1.57 kB
Upload folder using huggingface_hub
3 months ago
README.md
4.13 kB
HLWQ rebrand: title, tags, notice, self-links
2 months ago
chat_template.jinja
4.05 kB
Upload folder using huggingface_hub
3 months ago
compression.png
58.6 kB
Upload compression.png with huggingface_hub
3 months ago
config.json
2.04 kB
fix: quant_method polar -> polarengine for vLLM compatibility
3 months ago
family.png
44.8 kB
Upload family.png with huggingface_hub
3 months ago
hlwq_config.json
261 Bytes
Add hlwq_config.json (rename from polar_config.json)
2 months ago
kv_speed.png
35.5 kB
Upload kv_speed.png with huggingface_hub
3 months ago
model_int4.pt
6.25 GB
xet
Upload model_int4.pt with huggingface_hub
3 months ago
tokenizer.json
20 MB
xet
Upload folder using huggingface_hub
3 months ago
tokenizer_config.json
1.17 kB
Upload folder using huggingface_hub
3 months ago