Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
LeaderboardModel1
/
neo-3-1B-A90M-Base-AutoRound-W4A16-RTN
like
0
Follow
Leaderboard Optimized Model 1
3
Text Generation
Safetensors
mixtral
quantized
w4a16
autoround
low-bit-open-llm-leaderboard
4-bit precision
auto-round
arxiv:
2309.05516
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
neo-3-1B-A90M-Base-AutoRound-W4A16-RTN
739 MB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
INC4AI
Upload quantized model neo-3-1B-A90M-Base-AutoRound-W4A16-RTN
e1485f6
verified
5 days ago
.gitattributes
Safe
1.52 kB
initial commit
5 days ago
README.md
6.13 kB
Upload quantized model neo-3-1B-A90M-Base-AutoRound-W4A16-RTN
5 days ago
config.json
1.5 kB
Upload quantized model neo-3-1B-A90M-Base-AutoRound-W4A16-RTN
5 days ago
generation_config.json
132 Bytes
Upload quantized model neo-3-1B-A90M-Base-AutoRound-W4A16-RTN
5 days ago
model.safetensors
735 MB
xet
Upload quantized model neo-3-1B-A90M-Base-AutoRound-W4A16-RTN
5 days ago
quantization_config.json
428 Bytes
Upload quantized model neo-3-1B-A90M-Base-AutoRound-W4A16-RTN
5 days ago
tokenizer.json
Safe
3.52 MB
Upload quantized model neo-3-1B-A90M-Base-AutoRound-W4A16-RTN
5 days ago
tokenizer_config.json
339 Bytes
Upload quantized model neo-3-1B-A90M-Base-AutoRound-W4A16-RTN
5 days ago