How to use from
Hermes Agent
Start the llama.cpp server
# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf kalle07/GLM-Z1-9B-0414-heretic_R9_KL0015-gguf:Q4_K_M
Configure Hermes
# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default kalle07/GLM-Z1-9B-0414-heretic_R9_KL0015-gguf:Q4_K_M
Run Hermes
hermes
Quick Links

This is a really uncensored version of zai-org/GLM-Z1-9B-0414 created with Heretic
https://github.com/p-e-w/heretic

initial Refusals 58/100
-> now 9 Refusals with KL=0.015
18 sounds like a lot, but I added 30 more refusal_markers, so there may be fewer without them.
I also added 17 more bad prompts to the dataset.
Since this is a reasoning/thinking model, I suggest that you also adapt the system prompt to your needs.

Note: This heretic model is highly uncensored; thus use it with extreme caution and care.



Downloads last month
9
GGUF
Model size
9B params
Architecture
glm4
Hardware compatibility
Log In to add your hardware

4-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kalle07/GLM-Z1-9B-0414-heretic_R9_KL0015-gguf

Quantized
(24)
this model

Collection including kalle07/GLM-Z1-9B-0414-heretic_R9_KL0015-gguf