---
base_model:
- prithivMLmods/gemma-4-12B-it-heretic_decensored
tags:
- text-generation-inference
- llama-cpp
- decensored
- abliterated
- unfiltered
- unredacted
- heretic
license: apache-2.0
language:
- en
pipeline_tag: image-text-to-text
library_name: transformers
---

# **gemma-4-12B-it-heretic_decensored-GGUF**

> **gemma-4-12B-it-heretic_decensored** is a reasoning-capable language model built on top of **google/gemma-4-12B-it** and modified using the **Heretic** abliteration toolkit. The model applies refusal-direction analysis and targeted weight-space interventions to reduce internal refusal behaviors while preserving instruction-following, reasoning capabilities, and general conversational performance.

> [!IMPORTANT]
> This model is intended strictly for research and learning purposes. Due to reduced internal refusal mechanisms, it may generate sensitive or unrestricted content. Users assume full responsibility for how the model is used. The authors and hosting platform disclaim any liability for generated outputs.

> [!NOTE]
> This model is experimental and may generate unexpected behaviors or artifacts in certain scenarios.

## Model Files

   File Name | Quant Type | File Size | File Link |
 |-----------|------------|-----------|-----------|
 | gemma-4-12B-it-heretic_decensored.BF16.gguf | BF16 | 23.8 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.BF16.gguf) |
 | gemma-4-12B-it-heretic_decensored.F16.gguf | F16 | 23.8 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.F16.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q2_K.gguf | Q2_K | 4.83 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q2_K.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q3_K_L.gguf | Q3_K_L | 6.57 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q3_K_L.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q3_K_M.gguf | Q3_K_M | 6.09 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q3_K_M.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q3_K_S.gguf | Q3_K_S | 5.53 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q3_K_S.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q4_0.gguf | Q4_0 | 6.98 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q4_0.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q4_K_M.gguf | Q4_K_M | 7.38 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q4_K_M.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q4_K_S.gguf | Q4_K_S | 7.02 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q4_K_S.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q5_0.gguf | Q5_0 | 8.34 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q5_0.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q5_K_M.gguf | Q5_K_M | 8.55 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q5_K_M.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q5_K_S.gguf | Q5_K_S | 8.34 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q5_K_S.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q6_K.gguf | Q6_K | 9.79 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q6_K.gguf) |
 | gemma-4-12B-it-heretic_decensored.Q8_0.gguf | Q8_0 | 12.7 GB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.Q8_0.gguf) |
 | gemma-4-12B-it-heretic_decensored.mmproj-bf16.gguf | mmproj-bf16 | 175 MB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.mmproj-bf16.gguf) |
 | gemma-4-12B-it-heretic_decensored.mmproj-f16.gguf | mmproj-f16 | 175 MB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.mmproj-f16.gguf) |
 | gemma-4-12B-it-heretic_decensored.mmproj-q8_0.gguf | mmproj-q8_0 | 159 MB | [Download](https://huggingface.co/prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF/blob/main/gemma-4-12B-it-heretic_decensored.mmproj-q8_0.gguf) |

## Quick Start with llama.cpp (Docker)

```dockerfile
FROM ghcr.io/ggml-org/llama.cpp:full

WORKDIR /app

RUN apt update && apt install -y python3-pip
RUN pip install -U huggingface_hub --break-system-packages

RUN python3 -c 'from huggingface_hub import hf_hub_download; \
    repo="prithivMLmods/gemma-4-12B-it-heretic_decensored-GGUF"; \
    hf_hub_download(repo_id=repo, filename="gemma-4-12B-it-heretic_decensored.Q8_0.gguf", local_dir="/app"); \
    hf_hub_download(repo_id=repo, filename="gemma-4-12B-it-heretic_decensored.mmproj-f16.gguf", local_dir="/app")'

CMD ["--server", \
     "-m", "/app/gemma-4-12B-it-heretic_decensored.Q8_0.gguf", \
     "--mmproj", "/app/gemma-4-12B-it-heretic_decensored.mmproj-f16.gguf", \
     "--host", "0.0.0.0", \
     "--port", "7860", \
     "-t", "2", \
     "--cache-type-k", "q8_0", \
     "--cache-type-v", "iq4_nl", \
     "-c", "128000", \
     "-n", "38912"]
```

## llama.cpp

LLM inference in C/C++ — https://github.com/ggml-org/llama.cpp