Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
LeaderboardModel1
/
Qwen2-0.2B-pt-AutoRound-W4A16-RTN
like
0
Follow
Leaderboard Optimized Model 1
3
Text Generation
Safetensors
qwen2
quantized
w4a16
autoround
low-bit-open-llm-leaderboard
conversational
4-bit precision
auto-round
arxiv:
2309.05516
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
Qwen2-0.2B-pt-AutoRound-W4A16-RTN
336 MB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
INC4AI
Upload quantized model Qwen2-0.2B-pt-AutoRound-W4A16-RTN
5f5803c
verified
7 days ago
.gitattributes
Safe
1.57 kB
Upload quantized model Qwen2-0.2B-pt-AutoRound-W4A16-RTN
7 days ago
README.md
6.09 kB
Upload quantized model Qwen2-0.2B-pt-AutoRound-W4A16-RTN
7 days ago
chat_template.jinja
Safe
201 Bytes
Upload quantized model Qwen2-0.2B-pt-AutoRound-W4A16-RTN
7 days ago
config.json
1.28 kB
Upload quantized model Qwen2-0.2B-pt-AutoRound-W4A16-RTN
7 days ago
generation_config.json
226 Bytes
Upload quantized model Qwen2-0.2B-pt-AutoRound-W4A16-RTN
7 days ago
model.safetensors
324 MB
xet
Upload quantized model Qwen2-0.2B-pt-AutoRound-W4A16-RTN
7 days ago
quantization_config.json
Safe
302 Bytes
Upload quantized model Qwen2-0.2B-pt-AutoRound-W4A16-RTN
7 days ago
tokenizer.json
Safe
11.4 MB
xet
Upload quantized model Qwen2-0.2B-pt-AutoRound-W4A16-RTN
7 days ago
tokenizer_config.json
888 Bytes
Upload quantized model Qwen2-0.2B-pt-AutoRound-W4A16-RTN
7 days ago