Paijo commited on
Commit
8002fb5
·
verified ·
1 Parent(s): 9f145a9

Add README.md

Browse files
Files changed (1) hide show
  1. README.md +58 -0
README.md ADDED
@@ -0,0 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: Qwen/Qwen3-4B
4
+ tags:
5
+ - abliterated
6
+ - uncensored
7
+ - qwen3
8
+ - ollama
9
+ ---
10
+
11
+ # Qwen3-4B Abliterated (Uncensored)
12
+
13
+ **Base model**: Qwen/Qwen3-4B
14
+ **Method**: Heretic abliteration (refusal direction removal)
15
+ **Result**: 9/10 refusals removed, KL divergence: 0.0388
16
+
17
+ ## What is abliteration?
18
+
19
+ Abliteration removes the "refusal direction" from model weights directly — not prompt engineering. The model **cannot be made to refuse** by any system prompt or jailbreak because the refusal capability is removed at the weight level.
20
+
21
+ ## Performance
22
+
23
+ | Metric | Value |
24
+ |--------|-------|
25
+ | Refusals removed | 9/10 (90%) |
26
+ | KL divergence | 0.0388 |
27
+ | Base model quality | Preserved |
28
+
29
+ KL < 0.05 = minimal capability damage.
30
+
31
+ ## Usage
32
+
33
+ ### Ollama
34
+ ```bash
35
+ ollama run hf.co/paijo77/qwen3-4b-abliterated
36
+ ```
37
+
38
+ ### Transformers
39
+ ```python
40
+ from transformers import AutoModelForCausalLM, AutoTokenizer
41
+ import torch
42
+
43
+ model = AutoModelForCausalLM.from_pretrained(
44
+ "paijo77/qwen3-4b-abliterated",
45
+ torch_dtype=torch.float16,
46
+ device_map="auto"
47
+ )
48
+ tokenizer = AutoTokenizer.from_pretrained("paijo77/qwen3-4b-abliterated")
49
+ ```
50
+
51
+ ## Support
52
+
53
+ If this model is useful:
54
+ 👉 **https://www.tip.md/oyi77**
55
+
56
+ ## License
57
+
58
+ Apache 2.0