---
language:
- en
license: apache-2.0
tags:
- roleplay
- sillytavern
- idol
- google
- pytorch
- DarkIdol
- gemma
- gemma4
library_name: transformers
pipeline_tag: image-text-to-text
base_model: aifeifei798/Gemma-4-31B-Cognitive-Unshackled
---

- Original model: [`aifeifei798/Gemma-4-31B-Cognitive-Unshackled`](https://huggingface.co/aifeifei798/Gemma-4-31B-Cognitive-Unshackled) - refer for more details on the model.
- This is a backup quant inferior to mradermacher/Gemma-4-31B-Cognitive-Unshackled-i1-GGUF. I recommend: Gemma-4-31B-Cognitive-Unshackled.i1-IQ4_XS.gguf as a replacement.
---

Okay. So I ran this in llama.cpp and Silly Tavern chat completion. 
On my 24VRAM this fits 32000 context at f16 kv cache and 1024 batch. 
Works fine, long RP context coherence, thinking and no thinking. No breaking, no issues.

## Example Dialogue

![Example dialogue](https://huggingface.co/s1arsky/Gemma-4-31B-Cognitive-Unshackled-Q4_KS_GGUF/resolve/main/example_dialogue.png)