--- language: - uz - en license: cc-by-nc-4.0 datasets: - yakhyo/uz-wiki - tahrirchi/uz-books-v2 - tahrirchi/uz-crawl - saillab/alpaca_uzbek_taco - behbudiy/alpaca-cleaned-uz - UAzimov/uzbek-instruct-llm - CohereLabs/aya_collection_language_split - med-alex/qa_mt_ru_to_uzn - med-alex/qa_mt_tr_to_uzn library_name: gguf pipeline_tag: text-generation base_model: inspirebek/qwen3-4b-uzbek-v2 tags: - uzbek - qwen3 - quantized - gguf - llama.cpp - ollama --- # qwen3-4b-uzbek-v2-gguf gguf suite for [`inspirebek/qwen3-4b-uzbek-v2`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2). cpu / apple silicon / vulkan / rocm via `llama.cpp`, ollama, lm studio, etc. ## files | quant | size | notes | |---|---|---| | `f16` | 8.8 gb | reference fp16 | | `Q8_0` | 4.7 gb | near-lossless | | `Q6_K` | 3.6 gb | recommended for quality | | `Q5_K_M` | 3.2 gb | balanced | | `Q5_K_S` | 3.1 gb | slightly lighter | | `Q4_K_M` | 2.7 gb | **recommended for most users** | | `Q4_K_S` | 2.6 gb | smaller, slight quality loss | | `Q3_K_M` | 2.2 gb | aggressive | | `Q2_K` | 1.8 gb | edge / low-ram only | ## usage **llama.cpp:** ```bash llama-cli -m qwen3-4b-uzbek-v2-q4_k_m.gguf -p "Salom! Qalaysan?" -cnv ``` **ollama:** ```bash ollama run hf.co/inspirebek/qwen3-4b-uzbek-v2-GGUF:Q4_K_M ``` ## quantization converted from the bf16 merged model via `llama.cpp`'s `convert_hf_to_gguf.py` → `llama-quantize`. no calibration data (k-quants are statistics-only). ## datasets **stage a — fluency (continued pretraining):** - [`yakhyo/uz-wiki`](https://huggingface.co/datasets/yakhyo/uz-wiki) · MIT - [`tahrirchi/uz-books-v2`](https://huggingface.co/datasets/tahrirchi/uz-books-v2) · MIT - [`tahrirchi/uz-crawl`](https://huggingface.co/datasets/tahrirchi/uz-crawl) · Apache-2.0 **stage b — instruct (sft):** - [`saillab/alpaca_uzbek_taco`](https://huggingface.co/datasets/saillab/alpaca_uzbek_taco) · CC-BY-NC-4.0 - [`behbudiy/alpaca-cleaned-uz`](https://huggingface.co/datasets/behbudiy/alpaca-cleaned-uz) · CC-BY-4.0 - [`UAzimov/uzbek-instruct-llm`](https://huggingface.co/datasets/UAzimov/uzbek-instruct-llm) · Apache-2.0 - [`CohereLabs/aya_collection_language_split`](https://huggingface.co/datasets/CohereLabs/aya_collection_language_split) · Apache-2.0 - [`med-alex/qa_mt_ru_to_uzn`](https://huggingface.co/datasets/med-alex/qa_mt_ru_to_uzn) · unspecified - [`med-alex/qa_mt_tr_to_uzn`](https://huggingface.co/datasets/med-alex/qa_mt_tr_to_uzn) · unspecified > ⚠️ licensing note: `saillab/alpaca_uzbek_taco` is cc-by-nc-4.0, which restricts commercial use of derivative models. downstream users who need a fully permissive license should retrain without that subset. ## sibling formats - [`inspirebek/qwen3-4b-uzbek-v2`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2) - [`inspirebek/qwen3-4b-uzbek-v2-lora`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2-lora) - [`inspirebek/qwen3-4b-uzbek-v2-bnb-4bit`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2-bnb-4bit) - [`inspirebek/qwen3-4b-uzbek-v2-awq`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2-awq) - [`inspirebek/qwen3-4b-uzbek-v2-GGUF`](https://huggingface.co/inspirebek/qwen3-4b-uzbek-v2-GGUF)