FUMEA-F Dense GGUF — Quantized Versions

GGUF quantizations of uaytug/fumea-f-dense — a DARE+TIES merge of 4 specialized Qwen3-8B financial/marketing models.

Architecture

Detail Value
Base Qwen3-8B
Merge Method DARE+TIES (density 0.75)
Context 128K (YaRN RoPE)
Tool Use Supported

Quantizations

File Quant Size Use Case
fumea-f-dense-F32.gguf F32 30.52 GB Research, maximum quality
fumea-f-dense-F16.gguf F16 15.26 GB GPU inference, full precision
fumea-f-dense-BF16.gguf BF16 15.26 GB GPU inference, bfloat16
fumea-f-dense-Q8_0.gguf Q8_0 8.11 GB Near-lossless, high-end GPU/CPU
fumea-f-dense-Q6_K.gguf Q6_K 6.26 GB Excellent quality, high-end hardware
fumea-f-dense-Q5_K_M.gguf Q5_K_M 5.45 GB Very good quality, balanced
fumea-f-dense-Q5_K_S.gguf Q5_K_S 5.33 GB Very good quality, smaller
fumea-f-dense-Q5_0.gguf Q5_0 5.33 GB Good quality, moderate hardware
fumea-f-dense-Q4_K_M.gguf Q4_K_M 4.68 GB ⭐ Recommended — best quality/size
fumea-f-dense-Q4_K_S.gguf Q4_K_S 4.47 GB Good quality, smaller footprint
fumea-f-dense-Q4_0.gguf Q4_0 4.45 GB Decent quality, broad compatibility
fumea-f-dense-Q3_K_L.gguf Q3_K_L 4.13 GB Acceptable quality, lower memory
fumea-f-dense-Q3_K_M.gguf Q3_K_M 3.84 GB Lower quality, memory constrained
fumea-f-dense-Q3_K_S.gguf Q3_K_S 3.51 GB Lower quality, very constrained
fumea-f-dense-Q2_K.gguf Q2_K 3.06 GB Minimal quality, extreme compression
fumea-f-dense-IQ4_XS.gguf IQ4_XS 4.28 GB imatrix 4-bit extra small
fumea-f-dense-IQ4_NL.gguf IQ4_NL 4.49 GB Non-linear 4-bit
fumea-f-dense-IQ3_M.gguf IQ3_M 3.63 GB imatrix 3-bit medium
fumea-f-dense-IQ3_S.gguf IQ3_S 3.53 GB imatrix 3-bit small
fumea-f-dense-IQ3_XXS.gguf IQ3_XXS 0.65 GB imatrix 3-bit XXS
fumea-f-dense-IQ2_M.gguf IQ2_M 0.65 GB Extreme 2-bit medium

Recommended: Q4_K_M for the best quality/size balance.

Ollama

wget https://huggingface.co/uaytug/fumea-f-dense-gguf/resolve/main/fumea-f-dense-Q4_K_M.gguf

cat > Modelfile << 'EOF'
FROM ./fumea-f-dense-Q4_K_M.gguf
PARAMETER temperature 0.6
PARAMETER top_p 0.9
PARAMETER repeat_penalty 1.1
PARAMETER num_ctx 32768
SYSTEM You are FUMEA-F, a financial analysis and marketing intelligence assistant.
EOF

ollama create fumea-f-dense -f Modelfile
ollama run fumea-f-dense

LM Studio

  1. Download any GGUF file from this repo
  2. LM Studio → My Models → Import
  3. Start chatting

See Also

License

Apache 2.0

Downloads last month
133
GGUF
Model size
8B params
Architecture
qwen3
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for uaytug/fumea-f-dense-gguf

Quantized
(1)
this model

Collection including uaytug/fumea-f-dense-gguf