--- language: - en license: apache-2.0 tags: - roleplay - sillytavern - idol - google - pytorch - DarkIdol - gemma - gemma4 library_name: transformers pipeline_tag: image-text-to-text base_model: aifeifei798/Gemma-4-31B-Cognitive-Unshackled --- - Original model: [`aifeifei798/Gemma-4-31B-Cognitive-Unshackled`](https://huggingface.co/aifeifei798/Gemma-4-31B-Cognitive-Unshackled) - refer for more details on the model. - This is a backup quant inferior to mradermacher/Gemma-4-31B-Cognitive-Unshackled-i1-GGUF. I recommend: Gemma-4-31B-Cognitive-Unshackled.i1-IQ4_XS.gguf as a replacement. --- Okay. So I ran this in llama.cpp and Silly Tavern chat completion. On my 24VRAM this fits 32000 context at f16 kv cache and 1024 batch. Works fine, long RP context coherence, thinking and no thinking. No breaking, no issues. ## Example Dialogue ![Example dialogue](https://huggingface.co/s1arsky/Gemma-4-31B-Cognitive-Unshackled-Q4_KS_GGUF/resolve/main/example_dialogue.png)