What about uncensored/abliterated version?

#2
by tima2431 - opened

Hi! just wanted to say this cerebellum v6 is very cool model, works so well. i was wonderin if you plan to do an abliterated/uncensored version or something? i really liked how smart it is, just want it without all the censore and refuses. keep up the work!

Hi! just wanted to say this cerebellum v6 is very cool model, works so well. i was wonderin if you plan to do an abliterated/uncensored version or something? i really liked how smart it is, just want it without all the censore and refuses. keep up the work!

thank you! I'm glad you like it! im actually working on doing this. im just not trying to follow the know ways, at least not without stumbling into them. So it may take me longer, but i seek to avoid the abliterated/uncensored sanity drop off if possible.

Hi! just wanted to say this cerebellum v6 is very cool model, works so well. i was wonderin if you plan to do an abliterated/uncensored version or something? i really liked how smart it is, just want it without all the censore and refuses. keep up the work!

thank you! I'm glad you like it! im actually working on doing this. im just not trying to follow the know ways, at least not without stumbling into them. So it may take me longer, but i seek to avoid the abliterated/uncensored sanity drop off if possible.

If you're looking for a base model with little drop off, I'd recommend taking a look at coder3101's stuff.
https://huggingface.co/coder3101/gemma-4-26B-A4B-it-heretic

Uploaded a separate Heretic/Cerebellum GGUF repo here:

https://huggingface.co/deucebucket/Gemma-4-26B-A4B-it-Heretic-Cerebellum-GGUF

This uses coder3101/gemma-4-26B-A4B-it-heretic as the source checkpoint and applies the Gemma 4 26B Cerebellum tensor recipe. I kept it separate from the regular Cerebellum repo and included the mmproj file plus current benchmark JSONs in the repo.

Current local results are listed on the model card: ARC-Challenge 95.48%, HellaSwag 83.49%, MMLU Redux 71.42%, vision smoke 6/6, and the project refusal harness measured 1/45 refused.

i just started testing it and so far it's just amazing, for my tasks and dialogues it works just fine! you have very cool models! <3

i just started testing it and so far it's just amazing, for my tasks and dialogues it works just fine! you have very cool models! <3

yeah, ive been using it since, thanks for the suggestion! its now my new daily driver! gemma 4, certainly has a lot of personality and knowledge packed in.

yeah, it's my daily driver now too! Honestly, the quality is insane, for my tasks it feels almost on par with Gemini 2.5 Pro, but completely uncensored, which is exactly what I needed. Hitting 35-45 t/s on an RTX 5060 is pure gold. Keep it going! also, one quick question since I couldn't find any info on this anywhere: when running this (and other models based on gemma4) through standard llama.cpp at high context lengths (like 25k+ tokens), the model sometimes completely stops using its reasoning/chain-of-thought phase. It just prints 'enough;' or a similar word and skips straight to the answer, even if I force reasoning parameters at launch. have you noticed this context degradation too, or maybe do you happen to know a fix/sampler tweak for it?

yeah, ive seen this, and it was kind of funny. I was using it in open code, and it got stopped working. i asked if it was complete, it worked for another 3 minutes, then said "no" and stopped again. Made me laugh at the long think and abrupt answer. Currently im kicking around qwen 3.5 9b to find our on a small scale if thats something i can actually improve. this might also be something to do with the chat token template, where im still working on getting all of that updated too, and theres also branch versions of llama.cpp that seemingly fix the thinking loops just hasnt made it to main yet, that ive seen.

yeah, ive been using it since, thanks for the suggestion! its now my new daily driver! gemma 4, certainly has a lot of personality and knowledge packed in.

You're welcome for the suggestion, thanks for giving it your crunching process. πŸ˜„

I like Gemma 4 26B-A4B, but without your GGUFs I have to close every single open process on my PC.

I've been testing this one extensively, and so far it feels far more capable than your other v6. I haven't seen a single wrong token... yet.

I've been testing this one extensively, and so far it feels far more capable than your other v6. I haven't seen a single wrong token... yet.

i did also notice heretic did perform better on all the tests i put it through, so that definitely worth noting. No clue what in the break down also improved accuracy.

Sign up or log in to comment