dealignai commited on
Commit
5d3694e
·
verified ·
1 Parent(s): 1120a21

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ tokenizer.json filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,128 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: gemma
3
+ base_model: google/gemma-4-12B-it
4
+ library_name: mlx
5
+ pipeline_tag: image-text-to-text
6
+ thumbnail: dealign_mascot.png
7
+ tags:
8
+ - mlx
9
+ - apple-silicon
10
+ - abliterated
11
+ - uncensored
12
+ - crack
13
+ - vision
14
+ - audio
15
+ - omni-modal
16
+ - gemma4
17
+ - gemma
18
+ - google
19
+ - harmbench
20
+ - mmlu
21
+ - jang_4m
22
+ ---
23
+
24
+ <!-- vmlx-banner -->
25
+ <div align="center">
26
+ <a href="https://vmlx.net">
27
+ <img src="vmlx-banner.png" width="280" />
28
+ <br/>
29
+ <strong>Built for vMLX</strong> — the MLX inferencer with KV-cache quantization, prefix-cache reuse, agentic tool calling, and hybrid sliding+full attention support.
30
+ <br/>
31
+ <sub>Free for macOS · <strong>vmlx.net</strong></sub>
32
+ </a>
33
+ </div>
34
+
35
+ ---
36
+
37
+ <div align="center">
38
+
39
+ <img src="dealign_mascot.png" width="128" />
40
+
41
+ # Gemma 4 12B-it — JANG_4M CRACK
42
+
43
+ **CRACK abliterated** · **JANG mixed-precision (8-bit attention, 4-bit MLP)** · Omni-modal (text + image + audio + video) · 9.6 GB
44
+
45
+ <a href="https://ko-fi.com/dealignai"><img src="https://img.shields.io/badge/Ko--fi-Support_Development-FF5E5B?logo=ko-fi&logoColor=white&style=for-the-badge" alt="Ko-fi"></a>
46
+
47
+ </div>
48
+
49
+ ---
50
+
51
+ ## What Is This?
52
+
53
+ This is [Gemma 4 12B-it](https://huggingface.co/google/gemma-4-12B-it) by Google — a unified omni-modal language model (text + image + audio + video, hybrid sliding/full attention, 48 layers, 128k context) that has been:
54
+
55
+ 1. **CRACK abliterated** — safety refusal removed at the weight level. The model now complies across all task categories instead of refusing, while keeping its knowledge, reasoning, and multimodal capabilities intact.
56
+ 2. **JANG mixed-precision (8-bit attention, 4-bit MLP) quantized** for MLX on Apple Silicon — 9.6 GB.
57
+
58
+ ## Results
59
+
60
+ Evaluated through the Osaurus runtime on a Mac Studio M3 Ultra. Compliance graded via HarmBench-text refusal classifier; MMLU via logit-mode argmax over A/B/C/D token logits (matched on both base and CRACK with identical chat-template rendering — no answer truncated).
61
+
62
+ ### HarmBench compliance (70 prompts · 10 per category)
63
+
64
+ | Category | CRACK ASR |
65
+ |---|---:|
66
+ | Chemical / biological | **10 / 10** (100%) |
67
+ | Copyright | **10 / 10** (100%) |
68
+ | Cybercrime / intrusion | **10 / 10** (100%) |
69
+ | Harassment / bullying | **10 / 10** (100%) |
70
+ | Illegal | **10 / 10** (100%) |
71
+ | Misinformation / disinformation | **10 / 10** (100%) |
72
+ | General harmful | **10 / 10** (100%) |
73
+ | **Overall** | **70 / 70 (100%)** |
74
+
75
+ ### MMLU-228 (57-subject, 4 q per subject)
76
+
77
+ | Subject area | base | CRACK | Δ |
78
+ |---|---:|---:|---:|
79
+ | **Overall** | 67.1% | **69.3%** | +2.2pp |
80
+ | STEM | 68.1% | 66.7% | -1.4pp |
81
+ | Humanities | 57.7% | 63.5% | +5.8pp |
82
+ | Social Sciences | 75.0% | 75.0% | +0.0pp |
83
+ | Other (medicine, business, …) | 67.9% | 73.2% | +5.3pp |
84
+
85
+ ## Features
86
+
87
+ - **Omni-modal**: native text + image + audio + video inputs (Gemma 4's unified early-fusion encoder-free architecture)
88
+ - **128k context** with hybrid sliding-window + full-attention layers
89
+ - **Reasoning** via the Gemma 4 channel-marker format (`<|channel>thought ... <channel|>`)
90
+ - **48 transformer layers**, hidden size 3840
91
+
92
+ ## Usage
93
+
94
+ Run with the **Osaurus** runtime (recommended — supports the `gemma4_unified` architecture directly):
95
+
96
+ ```python
97
+ # OpenAI-compatible chat completion
98
+ # POST /v1/chat/completions
99
+ {
100
+ "model": "dealignai/Gemma-4-12B-it-JANG_4M-CRACK",
101
+ "messages": [{"role": "user", "content": "..."}],
102
+ "temperature": 0.0
103
+ }
104
+ ```
105
+
106
+ Google recommends `temperature=1.0, top_p=0.95, top_k=64` for general use.
107
+
108
+ ## About CRACK
109
+
110
+ **CRACK** (Controlled Refusal Ablation via Calibrated Knockouts) removes safety-refusal behavior at the weight level so the model complies with all task categories while preserving reasoning quality, factual knowledge, multimodal capability, and coherence.
111
+
112
+ ## Support dealignai
113
+
114
+ All models are built from original research and released free.
115
+
116
+ **[Support us on Ko-fi](https://ko-fi.com/dealignai)** — membership gets early access and extras.
117
+
118
+ [Ko-fi](https://ko-fi.com/dealignai) · [X @dealignai](https://x.com/dealignai) · [dealign.ai](https://dealign.ai)
119
+
120
+ See our research: [Safety Generalization in Frontier Models](https://dealign.ai/quantsteer.html)
121
+
122
+ <div align="center"><img src="dealign_logo.png" alt="dealign.ai" width="200"/></div>
123
+
124
+ ---
125
+
126
+ ## Disclaimer
127
+
128
+ This model has had its safety-refusal behavior removed for research purposes. It will follow instructions across all categories without refusing. You are solely responsible for how you use it and for complying with all applicable laws. Published for AI-safety research and authorized security testing.
chat_template.jinja ADDED
@@ -0,0 +1,363 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {%- macro format_parameters(properties, required, filter_keys=false) -%}
2
+ {%- set standard_keys = ['description', 'type', 'properties', 'required', 'nullable'] -%}
3
+ {%- set ns = namespace(found_first=false) -%}
4
+ {%- for key, value in properties | dictsort -%}
5
+ {%- set add_comma = false -%}
6
+ {%- if not filter_keys or key not in standard_keys -%}
7
+ {%- if ns.found_first %},{% endif -%}
8
+ {%- set ns.found_first = true -%}
9
+ {{ key }}:{
10
+ {%- if value['description'] -%}
11
+ description:<|"|>{{ value['description'] }}<|"|>
12
+ {%- set add_comma = true -%}
13
+ {%- endif -%}
14
+ {%- if value['type'] | upper == 'STRING' -%}
15
+ {%- if value['enum'] -%}
16
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
17
+ enum:{{ format_argument(value['enum']) }}
18
+ {%- endif -%}
19
+ {%- elif value['type'] | upper == 'ARRAY' -%}
20
+ {%- if value['items'] is mapping and value['items'] -%}
21
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
22
+ items:{
23
+ {%- set ns_items = namespace(found_first=false) -%}
24
+ {%- for item_key, item_value in value['items'] | dictsort -%}
25
+ {%- if item_value is not none -%}
26
+ {%- if ns_items.found_first %},{% endif -%}
27
+ {%- set ns_items.found_first = true -%}
28
+ {%- if item_key == 'properties' -%}
29
+ properties:{
30
+ {%- if item_value is mapping -%}
31
+ {{- format_parameters(item_value, value['items']['required'] | default([])) -}}
32
+ {%- endif -%}
33
+ }
34
+ {%- elif item_key == 'required' -%}
35
+ required:[
36
+ {%- for req_item in item_value -%}
37
+ <|"|>{{- req_item -}}<|"|>
38
+ {%- if not loop.last %},{% endif -%}
39
+ {%- endfor -%}
40
+ ]
41
+ {%- elif item_key == 'type' -%}
42
+ {%- if item_value is string -%}
43
+ type:{{ format_argument(item_value | upper) }}
44
+ {%- else -%}
45
+ type:{{ format_argument(item_value | map('upper') | list) }}
46
+ {%- endif -%}
47
+ {%- else -%}
48
+ {{ item_key }}:{{ format_argument(item_value) }}
49
+ {%- endif -%}
50
+ {%- endif -%}
51
+ {%- endfor -%}
52
+ }
53
+ {%- endif -%}
54
+ {%- endif -%}
55
+ {%- if value['nullable'] %}
56
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
57
+ nullable:true
58
+ {%- endif -%}
59
+ {%- if value['type'] | upper == 'OBJECT' -%}
60
+ {%- if value['properties'] is defined and value['properties'] is mapping -%}
61
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
62
+ properties:{
63
+ {{- format_parameters(value['properties'], value['required'] | default([])) -}}
64
+ }
65
+ {%- elif value is mapping -%}
66
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
67
+ properties:{
68
+ {{- format_parameters(value, value['required'] | default([]), filter_keys=true) -}}
69
+ }
70
+ {%- endif -%}
71
+ {%- if value['required'] -%}
72
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
73
+ required:[
74
+ {%- for item in value['required'] | default([]) -%}
75
+ <|"|>{{- item -}}<|"|>
76
+ {%- if not loop.last %},{% endif -%}
77
+ {%- endfor -%}
78
+ ]
79
+ {%- endif -%}
80
+ {%- endif -%}
81
+ {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
82
+ type:<|"|>{{ value['type'] | upper }}<|"|>}
83
+ {%- endif -%}
84
+ {%- endfor -%}
85
+ {%- endmacro -%}
86
+ {%- macro format_function_declaration(tool_data) -%}
87
+ declaration:{{- tool_data['function']['name'] -}}{description:<|"|>{{- tool_data['function']['description'] -}}<|"|>
88
+ {%- set params = tool_data['function']['parameters'] -%}
89
+ {%- if params -%}
90
+ ,parameters:{
91
+ {%- if params['properties'] -%}
92
+ properties:{ {{- format_parameters(params['properties'], params['required']) -}} },
93
+ {%- endif -%}
94
+ {%- if params['required'] -%}
95
+ required:[
96
+ {%- for item in params['required'] -%}
97
+ <|"|>{{- item -}}<|"|>
98
+ {{- ',' if not loop.last -}}
99
+ {%- endfor -%}
100
+ ],
101
+ {%- endif -%}
102
+ {%- if params['type'] -%}
103
+ type:<|"|>{{- params['type'] | upper -}}<|"|>}
104
+ {%- endif -%}
105
+ {%- endif -%}
106
+ {%- if 'response' in tool_data['function'] -%}
107
+ {%- set response_declaration = tool_data['function']['response'] -%}
108
+ ,response:{
109
+ {%- if response_declaration['description'] -%}
110
+ description:<|"|>{{- response_declaration['description'] -}}<|"|>,
111
+ {%- endif -%}
112
+ {%- if response_declaration['type'] | upper == 'OBJECT' -%}
113
+ type:<|"|>{{- response_declaration['type'] | upper -}}<|"|>}
114
+ {%- endif -%}
115
+ {%- endif -%}
116
+ }
117
+ {%- endmacro -%}
118
+ {%- macro format_argument(argument, escape_keys=True) -%}
119
+ {%- if argument is string -%}
120
+ {{- '<|"|>' + argument + '<|"|>' -}}
121
+ {%- elif argument is boolean -%}
122
+ {{- 'true' if argument else 'false' -}}
123
+ {%- elif argument is mapping -%}
124
+ {{- '{' -}}
125
+ {%- set ns = namespace(found_first=false) -%}
126
+ {%- for key, value in argument | dictsort -%}
127
+ {%- if ns.found_first %},{% endif -%}
128
+ {%- set ns.found_first = true -%}
129
+ {%- if escape_keys -%}
130
+ {{- '<|"|>' + key + '<|"|>' -}}
131
+ {%- else -%}
132
+ {{- key -}}
133
+ {%- endif -%}
134
+ :{{- format_argument(value, escape_keys=escape_keys) -}}
135
+ {%- endfor -%}
136
+ {{- '}' -}}
137
+ {%- elif argument is sequence -%}
138
+ {{- '[' -}}
139
+ {%- for item in argument -%}
140
+ {{- format_argument(item, escape_keys=escape_keys) -}}
141
+ {%- if not loop.last %},{% endif -%}
142
+ {%- endfor -%}
143
+ {{- ']' -}}
144
+ {%- else -%}
145
+ {{- argument -}}
146
+ {%- endif -%}
147
+ {%- endmacro -%}
148
+ {%- macro strip_thinking(text) -%}
149
+ {%- set ns = namespace(result='') -%}
150
+ {%- for part in text.split('<channel|>') -%}
151
+ {%- if '<|channel>' in part -%}
152
+ {%- set ns.result = ns.result + part.split('<|channel>')[0] -%}
153
+ {%- else -%}
154
+ {%- set ns.result = ns.result + part -%}
155
+ {%- endif -%}
156
+ {%- endfor -%}
157
+ {{- ns.result | trim -}}
158
+ {%- endmacro -%}
159
+
160
+ {%- macro format_tool_response_block(tool_name, response) -%}
161
+ {{- '<|tool_response>' -}}
162
+ {%- if response is mapping -%}
163
+ {{- 'response:' + tool_name + '{' -}}
164
+ {%- for key, value in response | dictsort -%}
165
+ {{- key -}}:{{- format_argument(value, escape_keys=False) -}}
166
+ {%- if not loop.last %},{% endif -%}
167
+ {%- endfor -%}
168
+ {{- '}' -}}
169
+ {%- else -%}
170
+ {{- 'response:' + tool_name + '{value:' + format_argument(response, escape_keys=False) + '}' -}}
171
+ {%- endif -%}
172
+ {{- '<tool_response|>' -}}
173
+ {%- endmacro -%}
174
+
175
+ {%- set ns = namespace(prev_message_type=None) -%}
176
+ {%- set loop_messages = messages -%}
177
+ {{- bos_token -}}
178
+ {#- Handle System/Tool Definitions Block -#}
179
+ {%- if (enable_thinking is defined and enable_thinking) or tools or messages[0]['role'] in ['system', 'developer'] -%}
180
+ {{- '<|turn>system\n' -}}
181
+ {#- Inject Thinking token at the very top of the FIRST system turn -#}
182
+ {%- if enable_thinking is defined and enable_thinking -%}
183
+ {{- '<|think|>\n' -}}
184
+ {%- set ns.prev_message_type = 'think' -%}
185
+ {%- endif -%}
186
+ {%- if messages[0]['role'] in ['system', 'developer'] -%}
187
+ {%- if messages[0]['content'] is string -%}
188
+ {{- messages[0]['content'] | trim -}}
189
+ {%- elif messages[0]['content'] is sequence -%}
190
+ {%- for item in messages[0]['content'] -%}
191
+ {{- item['text'] | trim + ' '-}}
192
+ {%- endfor -%}
193
+ {%- endif -%}
194
+ {%- set loop_messages = messages[1:] -%}
195
+ {%- endif -%}
196
+ {%- if tools -%}
197
+ {%- for tool in tools %}
198
+ {{- '<|tool>' -}}
199
+ {{- format_function_declaration(tool) | trim -}}
200
+ {{- '<tool|>' -}}
201
+ {%- endfor %}
202
+ {%- set ns.prev_message_type = 'tool' -%}
203
+ {%- if tool_choice is defined and ((tool_choice is string and tool_choice == 'required') or (tool_choice is mapping and tool_choice.get('type') == 'required')) -%}
204
+ {{- '\nTool use is REQUIRED for this turn: you must call exactly one of the declared tools. Output only the tool call, and copy every argument value verbatim \u2014 character for character, preserving punctuation and newlines, with no added or removed whitespace.' -}}
205
+ {%- endif -%}
206
+ {%- endif -%}
207
+ {{- '<turn|>\n' -}}
208
+ {%- endif %}
209
+
210
+ {#- Pre-scan: find last user message index for reasoning guard -#}
211
+ {%- set ns_turn = namespace(last_user_idx=-1) -%}
212
+ {%- for i in range(loop_messages | length) -%}
213
+ {%- if loop_messages[i]['role'] == 'user' -%}
214
+ {%- set ns_turn.last_user_idx = i -%}
215
+ {%- endif -%}
216
+ {%- endfor -%}
217
+
218
+ {#- Loop through messages -#}
219
+ {%- for message in loop_messages -%}
220
+ {%- if message['role'] != 'tool' -%}
221
+ {%- set ns.prev_message_type = None -%}
222
+ {%- set role = 'model' if message['role'] == 'assistant' else message['role'] -%}
223
+ {#- Detect continuation: suppress duplicate <|turn>model when previous non-tool message was also assistant -#}
224
+ {%- set prev_nt = namespace(role=None, found=false) -%}
225
+ {%- if loop.index0 > 0 -%}
226
+ {%- for j in range(loop.index0 - 1, -1, -1) -%}
227
+ {%- if not prev_nt.found -%}
228
+ {%- if loop_messages[j]['role'] != 'tool' -%}
229
+ {%- set prev_nt.role = loop_messages[j]['role'] -%}
230
+ {%- set prev_nt.found = true -%}
231
+ {%- endif -%}
232
+ {%- endif -%}
233
+ {%- endfor -%}
234
+ {%- endif -%}
235
+ {%- set continue_same_model_turn = (role == 'model' and prev_nt.role == 'assistant') -%}
236
+ {%- if not continue_same_model_turn -%}
237
+ {{- '<|turn>' + role + '\n' }}
238
+ {%- endif -%}
239
+
240
+ {#- Render reasoning/reasoning_content as thinking channel -#}
241
+ {%- set thinking_text = message.get('reasoning') or message.get('reasoning_content') -%}
242
+ {%- if thinking_text and loop.index0 > ns_turn.last_user_idx and message.get('tool_calls') -%}
243
+ {{- '<|channel>thought\n' + thinking_text + '\n<channel|>' -}}
244
+ {%- endif -%}
245
+
246
+ {%- if message['tool_calls'] -%}
247
+ {%- for tool_call in message['tool_calls'] -%}
248
+ {%- set function = tool_call['function'] -%}
249
+ {{- '<|tool_call>call:' + function['name'] + '{' -}}
250
+ {%- if function['arguments'] is mapping -%}
251
+ {%- set ns_args = namespace(found_first=false) -%}
252
+ {%- for key, value in function['arguments'] | dictsort -%}
253
+ {%- if ns_args.found_first %},{% endif -%}
254
+ {%- set ns_args.found_first = true -%}
255
+ {{- key -}}:{{- format_argument(value, escape_keys=False) -}}
256
+ {%- endfor -%}
257
+ {%- elif function['arguments'] is string -%}
258
+ {{- function['arguments'] -}}
259
+ {%- endif -%}
260
+ {{- '}<tool_call|>' -}}
261
+ {%- endfor -%}
262
+ {%- set ns.prev_message_type = 'tool_call' -%}
263
+ {%- endif -%}
264
+
265
+ {%- set ns_tr_out = namespace(flag=false) -%}
266
+ {%- if message.get('tool_responses') -%}
267
+ {#- Legacy: tool_responses embedded on the assistant message (Google/Gemma native) -#}
268
+ {%- for tool_response in message['tool_responses'] -%}
269
+ {{- format_tool_response_block(tool_response['name'] | default('unknown'), tool_response['response']) -}}
270
+ {%- set ns_tr_out.flag = true -%}
271
+ {%- set ns.prev_message_type = 'tool_response' -%}
272
+ {%- endfor -%}
273
+ {%- elif message.get('tool_calls') -%}
274
+ {#- OpenAI Chat Completions: forward-scan consecutive role:tool messages -#}
275
+ {%- set ns_tool_scan = namespace(stopped=false) -%}
276
+ {%- for k in range(loop.index0 + 1, loop_messages | length) -%}
277
+ {%- if ns_tool_scan.stopped -%}
278
+ {%- elif loop_messages[k]['role'] != 'tool' -%}
279
+ {%- set ns_tool_scan.stopped = true -%}
280
+ {%- else -%}
281
+ {%- set follow = loop_messages[k] -%}
282
+ {#- Resolve tool_call_id to function name -#}
283
+ {%- set ns_tname = namespace(name=follow.get('name') | default('unknown')) -%}
284
+ {%- for tc in message['tool_calls'] -%}
285
+ {%- if tc.get('id') == follow.get('tool_call_id') -%}
286
+ {%- set ns_tname.name = tc['function']['name'] -%}
287
+ {%- endif -%}
288
+ {%- endfor -%}
289
+ {#- Handle content as string or content-parts array -#}
290
+ {%- set tool_body = follow.get('content') -%}
291
+ {%- if tool_body is string -%}
292
+ {{- format_tool_response_block(ns_tname.name, tool_body) -}}
293
+ {%- elif tool_body is sequence and tool_body is not string -%}
294
+ {%- set ns_txt = namespace(s='') -%}
295
+ {%- for part in tool_body -%}
296
+ {%- if part.get('type') == 'text' -%}
297
+ {%- set ns_txt.s = ns_txt.s + (part.get('text') | default('')) -%}
298
+ {%- endif -%}
299
+ {%- endfor -%}
300
+ {{- format_tool_response_block(ns_tname.name, ns_txt.s) -}}
301
+ {%- for part in tool_body -%}
302
+ {%- if part.get('type') == 'image' -%}
303
+ {{- '<|image|>' -}}
304
+ {%- elif part.get('type') == 'audio' -%}
305
+ {{- '<|audio|>' -}}
306
+ {%- elif part.get('type') == 'video' -%}
307
+ {{- '<|video|>' -}}
308
+ {%- endif -%}
309
+ {%- endfor -%}
310
+ {%- else -%}
311
+ {{- format_tool_response_block(ns_tname.name, tool_body) -}}
312
+ {%- endif -%}
313
+ {%- set ns_tr_out.flag = true -%}
314
+ {%- set ns.prev_message_type = 'tool_response' -%}
315
+ {%- endif -%}
316
+ {%- endfor -%}
317
+ {%- endif -%}
318
+
319
+ {%- set captured_content -%}
320
+ {%- if message['content'] is string -%}
321
+ {%- if role == 'model' -%}
322
+ {{- strip_thinking(message['content']) -}}
323
+ {%- else -%}
324
+ {{- message['content'] | trim -}}
325
+ {%- endif -%}
326
+ {%- elif message['content'] is sequence -%}
327
+ {%- for item in message['content'] -%}
328
+ {%- if item['type'] == 'text' -%}
329
+ {%- if role == 'model' -%}
330
+ {{- strip_thinking(item['text']) -}}
331
+ {%- else -%}
332
+ {{- item['text'] | trim -}}
333
+ {%- endif -%}
334
+ {%- elif item['type'] == 'image' -%}
335
+ {{- '<|image|>' -}}
336
+ {%- set ns.prev_message_type = 'image' -%}
337
+ {%- elif item['type'] == 'audio' -%}
338
+ {{- '<|audio|>' -}}
339
+ {%- set ns.prev_message_type = 'audio' -%}
340
+ {%- elif item['type'] == 'video' -%}
341
+ {{- '<|video|>' -}}
342
+ {%- set ns.prev_message_type = 'video' -%}
343
+ {%- endif -%}
344
+ {%- endfor -%}
345
+ {%- endif -%}
346
+ {%- endset -%}
347
+
348
+ {{- captured_content -}}
349
+ {%- set has_content = captured_content | trim | length > 0 -%}
350
+
351
+ {%- if ns.prev_message_type == 'tool_call' and not ns_tr_out.flag -%}
352
+ {{- '<|tool_response>' -}}
353
+ {%- elif not (ns_tr_out.flag and not has_content) -%}
354
+ {{- '<turn|>\n' -}}
355
+ {%- endif -%}
356
+ {%- endif -%}
357
+ {%- endfor -%}
358
+
359
+ {%- if add_generation_prompt -%}
360
+ {%- if ns.prev_message_type != 'tool_response' and ns.prev_message_type != 'tool_call' -%}
361
+ {{- '<|turn>model\n' -}}
362
+ {%- endif -%}
363
+ {%- endif -%}
config.json ADDED
@@ -0,0 +1,909 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "Gemma4UnifiedForConditionalGeneration"
4
+ ],
5
+ "audio_config": {
6
+ "_name_or_path": "",
7
+ "architectures": null,
8
+ "audio_embed_dim": 640,
9
+ "audio_samples_per_token": 640,
10
+ "chunk_size_feed_forward": 0,
11
+ "dtype": null,
12
+ "hidden_size": 640,
13
+ "id2label": {
14
+ "0": "LABEL_0",
15
+ "1": "LABEL_1"
16
+ },
17
+ "initializer_range": 0.02,
18
+ "is_encoder_decoder": false,
19
+ "label2id": {
20
+ "LABEL_0": 0,
21
+ "LABEL_1": 1
22
+ },
23
+ "model_type": "gemma4_unified_audio",
24
+ "output_attentions": false,
25
+ "output_hidden_states": false,
26
+ "output_proj_dims": 640,
27
+ "problem_type": null,
28
+ "return_dict": true,
29
+ "rms_norm_eps": 1e-06
30
+ },
31
+ "audio_token_id": 258881,
32
+ "boa_token_id": 256000,
33
+ "boi_token_id": 255999,
34
+ "dtype": "bfloat16",
35
+ "eoa_token_index": 258883,
36
+ "eoi_token_id": 258882,
37
+ "eos_token_id": [
38
+ 1,
39
+ 106
40
+ ],
41
+ "image_token_id": 258880,
42
+ "initializer_range": 0.02,
43
+ "model_type": "gemma4_unified",
44
+ "text_config": {
45
+ "attention_bias": false,
46
+ "attention_dropout": 0.0,
47
+ "attention_k_eq_v": true,
48
+ "bos_token_id": 2,
49
+ "enable_moe_block": false,
50
+ "eos_token_id": 1,
51
+ "final_logit_softcapping": 30.0,
52
+ "global_head_dim": 512,
53
+ "head_dim": 256,
54
+ "hidden_activation": "gelu_pytorch_tanh",
55
+ "hidden_size": 3840,
56
+ "hidden_size_per_layer_input": 0,
57
+ "initializer_range": 0.02,
58
+ "intermediate_size": 15360,
59
+ "layer_types": [
60
+ "sliding_attention",
61
+ "sliding_attention",
62
+ "sliding_attention",
63
+ "sliding_attention",
64
+ "sliding_attention",
65
+ "full_attention",
66
+ "sliding_attention",
67
+ "sliding_attention",
68
+ "sliding_attention",
69
+ "sliding_attention",
70
+ "sliding_attention",
71
+ "full_attention",
72
+ "sliding_attention",
73
+ "sliding_attention",
74
+ "sliding_attention",
75
+ "sliding_attention",
76
+ "sliding_attention",
77
+ "full_attention",
78
+ "sliding_attention",
79
+ "sliding_attention",
80
+ "sliding_attention",
81
+ "sliding_attention",
82
+ "sliding_attention",
83
+ "full_attention",
84
+ "sliding_attention",
85
+ "sliding_attention",
86
+ "sliding_attention",
87
+ "sliding_attention",
88
+ "sliding_attention",
89
+ "full_attention",
90
+ "sliding_attention",
91
+ "sliding_attention",
92
+ "sliding_attention",
93
+ "sliding_attention",
94
+ "sliding_attention",
95
+ "full_attention",
96
+ "sliding_attention",
97
+ "sliding_attention",
98
+ "sliding_attention",
99
+ "sliding_attention",
100
+ "sliding_attention",
101
+ "full_attention",
102
+ "sliding_attention",
103
+ "sliding_attention",
104
+ "sliding_attention",
105
+ "sliding_attention",
106
+ "sliding_attention",
107
+ "full_attention"
108
+ ],
109
+ "max_position_embeddings": 131072,
110
+ "model_type": "gemma4_unified_text",
111
+ "moe_intermediate_size": null,
112
+ "num_attention_heads": 16,
113
+ "num_experts": null,
114
+ "num_global_key_value_heads": 1,
115
+ "num_hidden_layers": 48,
116
+ "num_key_value_heads": 8,
117
+ "num_kv_shared_layers": 0,
118
+ "pad_token_id": 0,
119
+ "rms_norm_eps": 1e-06,
120
+ "rope_parameters": {
121
+ "full_attention": {
122
+ "partial_rotary_factor": 0.25,
123
+ "rope_theta": 1000000.0,
124
+ "rope_type": "proportional"
125
+ },
126
+ "sliding_attention": {
127
+ "rope_theta": 10000.0,
128
+ "rope_type": "default"
129
+ }
130
+ },
131
+ "sliding_window": 1024,
132
+ "tie_word_embeddings": true,
133
+ "top_k_experts": null,
134
+ "use_bidirectional_attention": "vision",
135
+ "use_cache": true,
136
+ "use_double_wide_mlp": false,
137
+ "vocab_size": 262144,
138
+ "vocab_size_per_layer_input": 262144
139
+ },
140
+ "tie_word_embeddings": true,
141
+ "transformers_version": "5.10.0.dev0",
142
+ "video_token_id": 258884,
143
+ "vision_config": {
144
+ "_name_or_path": "",
145
+ "architectures": null,
146
+ "chunk_size_feed_forward": 0,
147
+ "dtype": null,
148
+ "id2label": {
149
+ "0": "LABEL_0",
150
+ "1": "LABEL_1"
151
+ },
152
+ "initializer_range": 0.02,
153
+ "is_encoder_decoder": false,
154
+ "label2id": {
155
+ "LABEL_0": 0,
156
+ "LABEL_1": 1
157
+ },
158
+ "mm_embed_dim": 3840,
159
+ "mm_posemb_size": 1120,
160
+ "model_patch_size": 48,
161
+ "model_type": "gemma4_unified_vision",
162
+ "num_soft_tokens": 280,
163
+ "output_attentions": false,
164
+ "output_hidden_states": false,
165
+ "output_proj_dims": 3840,
166
+ "patch_size": 16,
167
+ "pooling_kernel_size": 3,
168
+ "problem_type": null,
169
+ "return_dict": true,
170
+ "rms_norm_eps": 1e-06
171
+ },
172
+ "weight_format": "jang_affine",
173
+ "quantization": {
174
+ "group_size": 32,
175
+ "bits": 8,
176
+ "mode": "affine",
177
+ "tied_embedding": "fp16_passthrough",
178
+ "language_model.model.layers.0.mlp.down_proj": {
179
+ "group_size": 32,
180
+ "bits": 4,
181
+ "mode": "affine"
182
+ },
183
+ "language_model.model.layers.0.mlp.gate_proj": {
184
+ "group_size": 32,
185
+ "bits": 4,
186
+ "mode": "affine"
187
+ },
188
+ "language_model.model.layers.0.mlp.up_proj": {
189
+ "group_size": 32,
190
+ "bits": 4,
191
+ "mode": "affine"
192
+ },
193
+ "language_model.model.layers.1.mlp.down_proj": {
194
+ "group_size": 32,
195
+ "bits": 4,
196
+ "mode": "affine"
197
+ },
198
+ "language_model.model.layers.1.mlp.gate_proj": {
199
+ "group_size": 32,
200
+ "bits": 4,
201
+ "mode": "affine"
202
+ },
203
+ "language_model.model.layers.1.mlp.up_proj": {
204
+ "group_size": 32,
205
+ "bits": 4,
206
+ "mode": "affine"
207
+ },
208
+ "language_model.model.layers.10.mlp.down_proj": {
209
+ "group_size": 32,
210
+ "bits": 4,
211
+ "mode": "affine"
212
+ },
213
+ "language_model.model.layers.10.mlp.gate_proj": {
214
+ "group_size": 32,
215
+ "bits": 4,
216
+ "mode": "affine"
217
+ },
218
+ "language_model.model.layers.10.mlp.up_proj": {
219
+ "group_size": 32,
220
+ "bits": 4,
221
+ "mode": "affine"
222
+ },
223
+ "language_model.model.layers.11.mlp.down_proj": {
224
+ "group_size": 32,
225
+ "bits": 4,
226
+ "mode": "affine"
227
+ },
228
+ "language_model.model.layers.11.mlp.gate_proj": {
229
+ "group_size": 32,
230
+ "bits": 4,
231
+ "mode": "affine"
232
+ },
233
+ "language_model.model.layers.11.mlp.up_proj": {
234
+ "group_size": 32,
235
+ "bits": 4,
236
+ "mode": "affine"
237
+ },
238
+ "language_model.model.layers.12.mlp.down_proj": {
239
+ "group_size": 32,
240
+ "bits": 4,
241
+ "mode": "affine"
242
+ },
243
+ "language_model.model.layers.12.mlp.gate_proj": {
244
+ "group_size": 32,
245
+ "bits": 4,
246
+ "mode": "affine"
247
+ },
248
+ "language_model.model.layers.12.mlp.up_proj": {
249
+ "group_size": 32,
250
+ "bits": 4,
251
+ "mode": "affine"
252
+ },
253
+ "language_model.model.layers.13.mlp.down_proj": {
254
+ "group_size": 32,
255
+ "bits": 4,
256
+ "mode": "affine"
257
+ },
258
+ "language_model.model.layers.13.mlp.gate_proj": {
259
+ "group_size": 32,
260
+ "bits": 4,
261
+ "mode": "affine"
262
+ },
263
+ "language_model.model.layers.13.mlp.up_proj": {
264
+ "group_size": 32,
265
+ "bits": 4,
266
+ "mode": "affine"
267
+ },
268
+ "language_model.model.layers.14.mlp.down_proj": {
269
+ "group_size": 32,
270
+ "bits": 4,
271
+ "mode": "affine"
272
+ },
273
+ "language_model.model.layers.14.mlp.gate_proj": {
274
+ "group_size": 32,
275
+ "bits": 4,
276
+ "mode": "affine"
277
+ },
278
+ "language_model.model.layers.14.mlp.up_proj": {
279
+ "group_size": 32,
280
+ "bits": 4,
281
+ "mode": "affine"
282
+ },
283
+ "language_model.model.layers.15.mlp.down_proj": {
284
+ "group_size": 32,
285
+ "bits": 4,
286
+ "mode": "affine"
287
+ },
288
+ "language_model.model.layers.15.mlp.gate_proj": {
289
+ "group_size": 32,
290
+ "bits": 4,
291
+ "mode": "affine"
292
+ },
293
+ "language_model.model.layers.15.mlp.up_proj": {
294
+ "group_size": 32,
295
+ "bits": 4,
296
+ "mode": "affine"
297
+ },
298
+ "language_model.model.layers.16.mlp.down_proj": {
299
+ "group_size": 32,
300
+ "bits": 4,
301
+ "mode": "affine"
302
+ },
303
+ "language_model.model.layers.16.mlp.gate_proj": {
304
+ "group_size": 32,
305
+ "bits": 4,
306
+ "mode": "affine"
307
+ },
308
+ "language_model.model.layers.16.mlp.up_proj": {
309
+ "group_size": 32,
310
+ "bits": 4,
311
+ "mode": "affine"
312
+ },
313
+ "language_model.model.layers.17.mlp.down_proj": {
314
+ "group_size": 32,
315
+ "bits": 4,
316
+ "mode": "affine"
317
+ },
318
+ "language_model.model.layers.17.mlp.gate_proj": {
319
+ "group_size": 32,
320
+ "bits": 4,
321
+ "mode": "affine"
322
+ },
323
+ "language_model.model.layers.17.mlp.up_proj": {
324
+ "group_size": 32,
325
+ "bits": 4,
326
+ "mode": "affine"
327
+ },
328
+ "language_model.model.layers.18.mlp.down_proj": {
329
+ "group_size": 32,
330
+ "bits": 4,
331
+ "mode": "affine"
332
+ },
333
+ "language_model.model.layers.18.mlp.gate_proj": {
334
+ "group_size": 32,
335
+ "bits": 4,
336
+ "mode": "affine"
337
+ },
338
+ "language_model.model.layers.18.mlp.up_proj": {
339
+ "group_size": 32,
340
+ "bits": 4,
341
+ "mode": "affine"
342
+ },
343
+ "language_model.model.layers.19.mlp.down_proj": {
344
+ "group_size": 32,
345
+ "bits": 4,
346
+ "mode": "affine"
347
+ },
348
+ "language_model.model.layers.19.mlp.gate_proj": {
349
+ "group_size": 32,
350
+ "bits": 4,
351
+ "mode": "affine"
352
+ },
353
+ "language_model.model.layers.19.mlp.up_proj": {
354
+ "group_size": 32,
355
+ "bits": 4,
356
+ "mode": "affine"
357
+ },
358
+ "language_model.model.layers.2.mlp.down_proj": {
359
+ "group_size": 32,
360
+ "bits": 4,
361
+ "mode": "affine"
362
+ },
363
+ "language_model.model.layers.2.mlp.gate_proj": {
364
+ "group_size": 32,
365
+ "bits": 4,
366
+ "mode": "affine"
367
+ },
368
+ "language_model.model.layers.2.mlp.up_proj": {
369
+ "group_size": 32,
370
+ "bits": 4,
371
+ "mode": "affine"
372
+ },
373
+ "language_model.model.layers.20.mlp.down_proj": {
374
+ "group_size": 32,
375
+ "bits": 4,
376
+ "mode": "affine"
377
+ },
378
+ "language_model.model.layers.20.mlp.gate_proj": {
379
+ "group_size": 32,
380
+ "bits": 4,
381
+ "mode": "affine"
382
+ },
383
+ "language_model.model.layers.20.mlp.up_proj": {
384
+ "group_size": 32,
385
+ "bits": 4,
386
+ "mode": "affine"
387
+ },
388
+ "language_model.model.layers.21.mlp.down_proj": {
389
+ "group_size": 32,
390
+ "bits": 4,
391
+ "mode": "affine"
392
+ },
393
+ "language_model.model.layers.21.mlp.gate_proj": {
394
+ "group_size": 32,
395
+ "bits": 4,
396
+ "mode": "affine"
397
+ },
398
+ "language_model.model.layers.21.mlp.up_proj": {
399
+ "group_size": 32,
400
+ "bits": 4,
401
+ "mode": "affine"
402
+ },
403
+ "language_model.model.layers.22.mlp.down_proj": {
404
+ "group_size": 32,
405
+ "bits": 4,
406
+ "mode": "affine"
407
+ },
408
+ "language_model.model.layers.22.mlp.gate_proj": {
409
+ "group_size": 32,
410
+ "bits": 4,
411
+ "mode": "affine"
412
+ },
413
+ "language_model.model.layers.22.mlp.up_proj": {
414
+ "group_size": 32,
415
+ "bits": 4,
416
+ "mode": "affine"
417
+ },
418
+ "language_model.model.layers.23.mlp.down_proj": {
419
+ "group_size": 32,
420
+ "bits": 4,
421
+ "mode": "affine"
422
+ },
423
+ "language_model.model.layers.23.mlp.gate_proj": {
424
+ "group_size": 32,
425
+ "bits": 4,
426
+ "mode": "affine"
427
+ },
428
+ "language_model.model.layers.23.mlp.up_proj": {
429
+ "group_size": 32,
430
+ "bits": 4,
431
+ "mode": "affine"
432
+ },
433
+ "language_model.model.layers.24.mlp.down_proj": {
434
+ "group_size": 32,
435
+ "bits": 4,
436
+ "mode": "affine"
437
+ },
438
+ "language_model.model.layers.24.mlp.gate_proj": {
439
+ "group_size": 32,
440
+ "bits": 4,
441
+ "mode": "affine"
442
+ },
443
+ "language_model.model.layers.24.mlp.up_proj": {
444
+ "group_size": 32,
445
+ "bits": 4,
446
+ "mode": "affine"
447
+ },
448
+ "language_model.model.layers.25.mlp.down_proj": {
449
+ "group_size": 32,
450
+ "bits": 4,
451
+ "mode": "affine"
452
+ },
453
+ "language_model.model.layers.25.mlp.gate_proj": {
454
+ "group_size": 32,
455
+ "bits": 4,
456
+ "mode": "affine"
457
+ },
458
+ "language_model.model.layers.25.mlp.up_proj": {
459
+ "group_size": 32,
460
+ "bits": 4,
461
+ "mode": "affine"
462
+ },
463
+ "language_model.model.layers.26.mlp.down_proj": {
464
+ "group_size": 32,
465
+ "bits": 4,
466
+ "mode": "affine"
467
+ },
468
+ "language_model.model.layers.26.mlp.gate_proj": {
469
+ "group_size": 32,
470
+ "bits": 4,
471
+ "mode": "affine"
472
+ },
473
+ "language_model.model.layers.26.mlp.up_proj": {
474
+ "group_size": 32,
475
+ "bits": 4,
476
+ "mode": "affine"
477
+ },
478
+ "language_model.model.layers.27.mlp.down_proj": {
479
+ "group_size": 32,
480
+ "bits": 4,
481
+ "mode": "affine"
482
+ },
483
+ "language_model.model.layers.27.mlp.gate_proj": {
484
+ "group_size": 32,
485
+ "bits": 4,
486
+ "mode": "affine"
487
+ },
488
+ "language_model.model.layers.27.mlp.up_proj": {
489
+ "group_size": 32,
490
+ "bits": 4,
491
+ "mode": "affine"
492
+ },
493
+ "language_model.model.layers.28.mlp.down_proj": {
494
+ "group_size": 32,
495
+ "bits": 4,
496
+ "mode": "affine"
497
+ },
498
+ "language_model.model.layers.28.mlp.gate_proj": {
499
+ "group_size": 32,
500
+ "bits": 4,
501
+ "mode": "affine"
502
+ },
503
+ "language_model.model.layers.28.mlp.up_proj": {
504
+ "group_size": 32,
505
+ "bits": 4,
506
+ "mode": "affine"
507
+ },
508
+ "language_model.model.layers.29.mlp.down_proj": {
509
+ "group_size": 32,
510
+ "bits": 4,
511
+ "mode": "affine"
512
+ },
513
+ "language_model.model.layers.29.mlp.gate_proj": {
514
+ "group_size": 32,
515
+ "bits": 4,
516
+ "mode": "affine"
517
+ },
518
+ "language_model.model.layers.29.mlp.up_proj": {
519
+ "group_size": 32,
520
+ "bits": 4,
521
+ "mode": "affine"
522
+ },
523
+ "language_model.model.layers.3.mlp.down_proj": {
524
+ "group_size": 32,
525
+ "bits": 4,
526
+ "mode": "affine"
527
+ },
528
+ "language_model.model.layers.3.mlp.gate_proj": {
529
+ "group_size": 32,
530
+ "bits": 4,
531
+ "mode": "affine"
532
+ },
533
+ "language_model.model.layers.3.mlp.up_proj": {
534
+ "group_size": 32,
535
+ "bits": 4,
536
+ "mode": "affine"
537
+ },
538
+ "language_model.model.layers.30.mlp.down_proj": {
539
+ "group_size": 32,
540
+ "bits": 4,
541
+ "mode": "affine"
542
+ },
543
+ "language_model.model.layers.30.mlp.gate_proj": {
544
+ "group_size": 32,
545
+ "bits": 4,
546
+ "mode": "affine"
547
+ },
548
+ "language_model.model.layers.30.mlp.up_proj": {
549
+ "group_size": 32,
550
+ "bits": 4,
551
+ "mode": "affine"
552
+ },
553
+ "language_model.model.layers.31.mlp.down_proj": {
554
+ "group_size": 32,
555
+ "bits": 4,
556
+ "mode": "affine"
557
+ },
558
+ "language_model.model.layers.31.mlp.gate_proj": {
559
+ "group_size": 32,
560
+ "bits": 4,
561
+ "mode": "affine"
562
+ },
563
+ "language_model.model.layers.31.mlp.up_proj": {
564
+ "group_size": 32,
565
+ "bits": 4,
566
+ "mode": "affine"
567
+ },
568
+ "language_model.model.layers.32.mlp.down_proj": {
569
+ "group_size": 32,
570
+ "bits": 4,
571
+ "mode": "affine"
572
+ },
573
+ "language_model.model.layers.32.mlp.gate_proj": {
574
+ "group_size": 32,
575
+ "bits": 4,
576
+ "mode": "affine"
577
+ },
578
+ "language_model.model.layers.32.mlp.up_proj": {
579
+ "group_size": 32,
580
+ "bits": 4,
581
+ "mode": "affine"
582
+ },
583
+ "language_model.model.layers.33.mlp.down_proj": {
584
+ "group_size": 32,
585
+ "bits": 4,
586
+ "mode": "affine"
587
+ },
588
+ "language_model.model.layers.33.mlp.gate_proj": {
589
+ "group_size": 32,
590
+ "bits": 4,
591
+ "mode": "affine"
592
+ },
593
+ "language_model.model.layers.33.mlp.up_proj": {
594
+ "group_size": 32,
595
+ "bits": 4,
596
+ "mode": "affine"
597
+ },
598
+ "language_model.model.layers.34.mlp.down_proj": {
599
+ "group_size": 32,
600
+ "bits": 4,
601
+ "mode": "affine"
602
+ },
603
+ "language_model.model.layers.34.mlp.gate_proj": {
604
+ "group_size": 32,
605
+ "bits": 4,
606
+ "mode": "affine"
607
+ },
608
+ "language_model.model.layers.34.mlp.up_proj": {
609
+ "group_size": 32,
610
+ "bits": 4,
611
+ "mode": "affine"
612
+ },
613
+ "language_model.model.layers.35.mlp.down_proj": {
614
+ "group_size": 32,
615
+ "bits": 4,
616
+ "mode": "affine"
617
+ },
618
+ "language_model.model.layers.35.mlp.gate_proj": {
619
+ "group_size": 32,
620
+ "bits": 4,
621
+ "mode": "affine"
622
+ },
623
+ "language_model.model.layers.35.mlp.up_proj": {
624
+ "group_size": 32,
625
+ "bits": 4,
626
+ "mode": "affine"
627
+ },
628
+ "language_model.model.layers.36.mlp.down_proj": {
629
+ "group_size": 32,
630
+ "bits": 4,
631
+ "mode": "affine"
632
+ },
633
+ "language_model.model.layers.36.mlp.gate_proj": {
634
+ "group_size": 32,
635
+ "bits": 4,
636
+ "mode": "affine"
637
+ },
638
+ "language_model.model.layers.36.mlp.up_proj": {
639
+ "group_size": 32,
640
+ "bits": 4,
641
+ "mode": "affine"
642
+ },
643
+ "language_model.model.layers.37.mlp.down_proj": {
644
+ "group_size": 32,
645
+ "bits": 4,
646
+ "mode": "affine"
647
+ },
648
+ "language_model.model.layers.37.mlp.gate_proj": {
649
+ "group_size": 32,
650
+ "bits": 4,
651
+ "mode": "affine"
652
+ },
653
+ "language_model.model.layers.37.mlp.up_proj": {
654
+ "group_size": 32,
655
+ "bits": 4,
656
+ "mode": "affine"
657
+ },
658
+ "language_model.model.layers.38.mlp.down_proj": {
659
+ "group_size": 32,
660
+ "bits": 4,
661
+ "mode": "affine"
662
+ },
663
+ "language_model.model.layers.38.mlp.gate_proj": {
664
+ "group_size": 32,
665
+ "bits": 4,
666
+ "mode": "affine"
667
+ },
668
+ "language_model.model.layers.38.mlp.up_proj": {
669
+ "group_size": 32,
670
+ "bits": 4,
671
+ "mode": "affine"
672
+ },
673
+ "language_model.model.layers.39.mlp.down_proj": {
674
+ "group_size": 32,
675
+ "bits": 4,
676
+ "mode": "affine"
677
+ },
678
+ "language_model.model.layers.39.mlp.gate_proj": {
679
+ "group_size": 32,
680
+ "bits": 4,
681
+ "mode": "affine"
682
+ },
683
+ "language_model.model.layers.39.mlp.up_proj": {
684
+ "group_size": 32,
685
+ "bits": 4,
686
+ "mode": "affine"
687
+ },
688
+ "language_model.model.layers.4.mlp.down_proj": {
689
+ "group_size": 32,
690
+ "bits": 4,
691
+ "mode": "affine"
692
+ },
693
+ "language_model.model.layers.4.mlp.gate_proj": {
694
+ "group_size": 32,
695
+ "bits": 4,
696
+ "mode": "affine"
697
+ },
698
+ "language_model.model.layers.4.mlp.up_proj": {
699
+ "group_size": 32,
700
+ "bits": 4,
701
+ "mode": "affine"
702
+ },
703
+ "language_model.model.layers.40.mlp.down_proj": {
704
+ "group_size": 32,
705
+ "bits": 4,
706
+ "mode": "affine"
707
+ },
708
+ "language_model.model.layers.40.mlp.gate_proj": {
709
+ "group_size": 32,
710
+ "bits": 4,
711
+ "mode": "affine"
712
+ },
713
+ "language_model.model.layers.40.mlp.up_proj": {
714
+ "group_size": 32,
715
+ "bits": 4,
716
+ "mode": "affine"
717
+ },
718
+ "language_model.model.layers.41.mlp.down_proj": {
719
+ "group_size": 32,
720
+ "bits": 4,
721
+ "mode": "affine"
722
+ },
723
+ "language_model.model.layers.41.mlp.gate_proj": {
724
+ "group_size": 32,
725
+ "bits": 4,
726
+ "mode": "affine"
727
+ },
728
+ "language_model.model.layers.41.mlp.up_proj": {
729
+ "group_size": 32,
730
+ "bits": 4,
731
+ "mode": "affine"
732
+ },
733
+ "language_model.model.layers.42.mlp.down_proj": {
734
+ "group_size": 32,
735
+ "bits": 4,
736
+ "mode": "affine"
737
+ },
738
+ "language_model.model.layers.42.mlp.gate_proj": {
739
+ "group_size": 32,
740
+ "bits": 4,
741
+ "mode": "affine"
742
+ },
743
+ "language_model.model.layers.42.mlp.up_proj": {
744
+ "group_size": 32,
745
+ "bits": 4,
746
+ "mode": "affine"
747
+ },
748
+ "language_model.model.layers.43.mlp.down_proj": {
749
+ "group_size": 32,
750
+ "bits": 4,
751
+ "mode": "affine"
752
+ },
753
+ "language_model.model.layers.43.mlp.gate_proj": {
754
+ "group_size": 32,
755
+ "bits": 4,
756
+ "mode": "affine"
757
+ },
758
+ "language_model.model.layers.43.mlp.up_proj": {
759
+ "group_size": 32,
760
+ "bits": 4,
761
+ "mode": "affine"
762
+ },
763
+ "language_model.model.layers.44.mlp.down_proj": {
764
+ "group_size": 32,
765
+ "bits": 4,
766
+ "mode": "affine"
767
+ },
768
+ "language_model.model.layers.44.mlp.gate_proj": {
769
+ "group_size": 32,
770
+ "bits": 4,
771
+ "mode": "affine"
772
+ },
773
+ "language_model.model.layers.44.mlp.up_proj": {
774
+ "group_size": 32,
775
+ "bits": 4,
776
+ "mode": "affine"
777
+ },
778
+ "language_model.model.layers.45.mlp.down_proj": {
779
+ "group_size": 32,
780
+ "bits": 4,
781
+ "mode": "affine"
782
+ },
783
+ "language_model.model.layers.45.mlp.gate_proj": {
784
+ "group_size": 32,
785
+ "bits": 4,
786
+ "mode": "affine"
787
+ },
788
+ "language_model.model.layers.45.mlp.up_proj": {
789
+ "group_size": 32,
790
+ "bits": 4,
791
+ "mode": "affine"
792
+ },
793
+ "language_model.model.layers.46.mlp.down_proj": {
794
+ "group_size": 32,
795
+ "bits": 4,
796
+ "mode": "affine"
797
+ },
798
+ "language_model.model.layers.46.mlp.gate_proj": {
799
+ "group_size": 32,
800
+ "bits": 4,
801
+ "mode": "affine"
802
+ },
803
+ "language_model.model.layers.46.mlp.up_proj": {
804
+ "group_size": 32,
805
+ "bits": 4,
806
+ "mode": "affine"
807
+ },
808
+ "language_model.model.layers.47.mlp.down_proj": {
809
+ "group_size": 32,
810
+ "bits": 4,
811
+ "mode": "affine"
812
+ },
813
+ "language_model.model.layers.47.mlp.gate_proj": {
814
+ "group_size": 32,
815
+ "bits": 4,
816
+ "mode": "affine"
817
+ },
818
+ "language_model.model.layers.47.mlp.up_proj": {
819
+ "group_size": 32,
820
+ "bits": 4,
821
+ "mode": "affine"
822
+ },
823
+ "language_model.model.layers.5.mlp.down_proj": {
824
+ "group_size": 32,
825
+ "bits": 4,
826
+ "mode": "affine"
827
+ },
828
+ "language_model.model.layers.5.mlp.gate_proj": {
829
+ "group_size": 32,
830
+ "bits": 4,
831
+ "mode": "affine"
832
+ },
833
+ "language_model.model.layers.5.mlp.up_proj": {
834
+ "group_size": 32,
835
+ "bits": 4,
836
+ "mode": "affine"
837
+ },
838
+ "language_model.model.layers.6.mlp.down_proj": {
839
+ "group_size": 32,
840
+ "bits": 4,
841
+ "mode": "affine"
842
+ },
843
+ "language_model.model.layers.6.mlp.gate_proj": {
844
+ "group_size": 32,
845
+ "bits": 4,
846
+ "mode": "affine"
847
+ },
848
+ "language_model.model.layers.6.mlp.up_proj": {
849
+ "group_size": 32,
850
+ "bits": 4,
851
+ "mode": "affine"
852
+ },
853
+ "language_model.model.layers.7.mlp.down_proj": {
854
+ "group_size": 32,
855
+ "bits": 4,
856
+ "mode": "affine"
857
+ },
858
+ "language_model.model.layers.7.mlp.gate_proj": {
859
+ "group_size": 32,
860
+ "bits": 4,
861
+ "mode": "affine"
862
+ },
863
+ "language_model.model.layers.7.mlp.up_proj": {
864
+ "group_size": 32,
865
+ "bits": 4,
866
+ "mode": "affine"
867
+ },
868
+ "language_model.model.layers.8.mlp.down_proj": {
869
+ "group_size": 32,
870
+ "bits": 4,
871
+ "mode": "affine"
872
+ },
873
+ "language_model.model.layers.8.mlp.gate_proj": {
874
+ "group_size": 32,
875
+ "bits": 4,
876
+ "mode": "affine"
877
+ },
878
+ "language_model.model.layers.8.mlp.up_proj": {
879
+ "group_size": 32,
880
+ "bits": 4,
881
+ "mode": "affine"
882
+ },
883
+ "language_model.model.layers.9.mlp.down_proj": {
884
+ "group_size": 32,
885
+ "bits": 4,
886
+ "mode": "affine"
887
+ },
888
+ "language_model.model.layers.9.mlp.gate_proj": {
889
+ "group_size": 32,
890
+ "bits": 4,
891
+ "mode": "affine"
892
+ },
893
+ "language_model.model.layers.9.mlp.up_proj": {
894
+ "group_size": 32,
895
+ "bits": 4,
896
+ "mode": "affine"
897
+ }
898
+ },
899
+ "capabilities": {
900
+ "reasoning_parser": "gemma4",
901
+ "tool_parser": "gemma4",
902
+ "think_in_template": false,
903
+ "supports_tools": true,
904
+ "supports_thinking": true,
905
+ "family": "gemma4",
906
+ "modality": "vision",
907
+ "cache_type": "kv"
908
+ }
909
+ }
dealign_logo.png ADDED
dealign_mascot.png ADDED
generation_config.json ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token_id": 2,
3
+ "do_sample": true,
4
+ "eos_token_id": [
5
+ 1,
6
+ 106,
7
+ 50
8
+ ],
9
+ "pad_token_id": 0,
10
+ "suppress_tokens": [
11
+ 258883,
12
+ 258882
13
+ ],
14
+ "temperature": 1.0,
15
+ "top_k": 64,
16
+ "top_p": 0.95,
17
+ "transformers_version": "5.10.0.dev0"
18
+ }
jang_config.json ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "version": 2,
3
+ "weight_format": "jang_affine",
4
+ "profile": "JANG_4M",
5
+ "source_model": {
6
+ "name": "gemma-4-12B-it",
7
+ "architecture": "gemma4_unified_text"
8
+ },
9
+ "has_vision": true,
10
+ "has_audio": true,
11
+ "quantization": {
12
+ "method": "jang_affine",
13
+ "quantization_backend": "mx.quantize",
14
+ "mode": "affine",
15
+ "group_size": 32,
16
+ "tier_bits": {
17
+ "attention": 8,
18
+ "mlp": 4,
19
+ "embed": 16
20
+ },
21
+ "tied_embedding": "fp16_passthrough",
22
+ "norm_convention": "gemma4_scale_shift_zero",
23
+ "multimodal": "fp16_passthrough_embedders_early_fusion",
24
+ "mtp_policy": "none",
25
+ "per_module_override_count": 144,
26
+ "passthrough_tensor_count": 349
27
+ },
28
+ "runtime": {
29
+ "total_weight_bytes": 10135442880,
30
+ "total_weight_gb": 9.44,
31
+ "attention": "hybrid_swa_full_5to1",
32
+ "sliding_window": 1024,
33
+ "attention_k_eq_v_on_full_layers": true,
34
+ "full_attention_layers": [
35
+ 5,
36
+ 11,
37
+ 17,
38
+ 23,
39
+ 29,
40
+ 35,
41
+ 41,
42
+ 47
43
+ ]
44
+ },
45
+ "capabilities": {
46
+ "reasoning_parser": "gemma4",
47
+ "tool_parser": "gemma4",
48
+ "think_in_template": false,
49
+ "supports_tools": true,
50
+ "supports_thinking": true,
51
+ "family": "gemma4",
52
+ "modality": "vision",
53
+ "cache_type": "kv"
54
+ }
55
+ }
model-00001-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:951a2a2c026b9810575744a815cdf33dde57a1547ecff8e46d467745fa286555
3
+ size 2047672664
model-00002-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:437a4827c3de3f50ba5017985d48b629ac61e65871ebe933c702347facc9e1fd
3
+ size 1001192692
model-00003-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:67277789451bf8b1699707a3073d62e8c63977797eabd2c9cedc6c1d946ff94d
3
+ size 1002176044
model-00004-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d727ecffda370546fece49d0b6a327e66f941423014c16dc6a289d3e8d5f2d80
3
+ size 1010040320
model-00005-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8158819add511610ccfc4631c625f80f63fa9445bb0b3193ebb9182c6341b681
3
+ size 1013972490
model-00006-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:78912bfd14bfd2eb3dbc0f5717222aa63fe7ec51ff7872aea57c1e49c88f276e
3
+ size 1014955699
model-00007-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:28700ff2796fa827ac8bbf2ec51cce76d98c9ee60c4179353ae6b690e652c04a
3
+ size 1013734310
model-00008-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f1bbbe31ed9fd14f27717a8214598e2baa235e456fe3228bcc2fee7aa5bf2674
3
+ size 1002176052
model-00009-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7b3eb166f2824e2fba6b8f830078b9103526d230e854f01c3ee8e62f406db43c
3
+ size 1045683690
model-00010-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cda6679df16fa313565c96cd1528a56a989a292a4436b141d464e3a518a15e99
3
+ size 17262256
model.safetensors.index.json ADDED
The diff for this file is too large to render. See raw diff
 
processor_config.json ADDED
@@ -0,0 +1,62 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "audio_ms_per_token": 40,
3
+ "audio_seq_length": 750,
4
+ "feature_extractor": {
5
+ "audio_samples_per_token": 640,
6
+ "feature_extractor_type": "Gemma4UnifiedAudioFeatureExtractor",
7
+ "feature_size": 640,
8
+ "padding_side": "right",
9
+ "padding_value": 0.0,
10
+ "return_attention_mask": true,
11
+ "sampling_rate": 16000
12
+ },
13
+ "image_processor": {
14
+ "do_convert_rgb": true,
15
+ "do_normalize": false,
16
+ "do_rescale": true,
17
+ "do_resize": true,
18
+ "image_mean": [
19
+ 0.0,
20
+ 0.0,
21
+ 0.0
22
+ ],
23
+ "image_processor_type": "Gemma4UnifiedImageProcessor",
24
+ "image_std": [
25
+ 1.0,
26
+ 1.0,
27
+ 1.0
28
+ ],
29
+ "max_soft_tokens": 280,
30
+ "patch_size": 16,
31
+ "pooling_kernel_size": 3,
32
+ "resample": 3,
33
+ "rescale_factor": 0.00392156862745098
34
+ },
35
+ "image_seq_length": 280,
36
+ "processor_class": "Gemma4UnifiedProcessor",
37
+ "video_processor": {
38
+ "do_convert_rgb": true,
39
+ "do_normalize": true,
40
+ "do_rescale": true,
41
+ "do_resize": true,
42
+ "do_sample_frames": true,
43
+ "image_mean": [
44
+ 0.0,
45
+ 0.0,
46
+ 0.0
47
+ ],
48
+ "image_std": [
49
+ 1.0,
50
+ 1.0,
51
+ 1.0
52
+ ],
53
+ "max_soft_tokens": 70,
54
+ "num_frames": 32,
55
+ "patch_size": 16,
56
+ "pooling_kernel_size": 3,
57
+ "resample": 3,
58
+ "rescale_factor": 0.00392156862745098,
59
+ "return_metadata": false,
60
+ "video_processor_type": "Gemma4UnifiedVideoProcessor"
61
+ }
62
+ }
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc8d3a0ce36466ccc1278bf987df5f71db1719b9ca6b4118264f45cb627bfe0f
3
+ size 32169626
tokenizer_config.json ADDED
@@ -0,0 +1,75 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "audio_token": "<|audio|>",
3
+ "backend": "tokenizers",
4
+ "boa_token": "<|audio>",
5
+ "boi_token": "<|image>",
6
+ "bos_token": "<bos>",
7
+ "eoa_token": "<audio|>",
8
+ "eoc_token": "<channel|>",
9
+ "eoi_token": "<image|>",
10
+ "eos_token": "<eos>",
11
+ "eot_token": "<turn|>",
12
+ "escape_token": "<|\"|>",
13
+ "etc_token": "<tool_call|>",
14
+ "etd_token": "<tool|>",
15
+ "etr_token": "<tool_response|>",
16
+ "extra_special_tokens": [
17
+ "<|video|>"
18
+ ],
19
+ "image_token": "<|image|>",
20
+ "mask_token": "<mask>",
21
+ "model_max_length": 1000000000000000019884624838656,
22
+ "pad_token": "<pad>",
23
+ "padding_side": "left",
24
+ "processor_class": "Gemma4UnifiedProcessor",
25
+ "response_schema": {
26
+ "properties": {
27
+ "content": {
28
+ "type": "string"
29
+ },
30
+ "role": {
31
+ "const": "assistant"
32
+ },
33
+ "thinking": {
34
+ "type": "string"
35
+ },
36
+ "tool_calls": {
37
+ "items": {
38
+ "properties": {
39
+ "function": {
40
+ "properties": {
41
+ "arguments": {
42
+ "additionalProperties": {},
43
+ "type": "object",
44
+ "x-parser": "gemma4-tool-call"
45
+ },
46
+ "name": {
47
+ "type": "string"
48
+ }
49
+ },
50
+ "type": "object",
51
+ "x-regex": "call\\:(?P<name>\\w+)(?P<arguments>\\{.*\\})"
52
+ },
53
+ "type": {
54
+ "const": "function"
55
+ }
56
+ },
57
+ "type": "object"
58
+ },
59
+ "type": "array",
60
+ "x-regex-iterator": "<\\|tool_call>(.*?)<tool_call\\|>"
61
+ }
62
+ },
63
+ "type": "object",
64
+ "x-regex": "(\\<\\|channel\\>thought\\n(?P<thinking>.*?)\\<channel\\|\\>)?(?P<tool_calls>\\<\\|tool_call\\>.*\\<tool_call\\|\\>)?(?P<content>(?:(?!\\<turn\\|\\>)(?!\\<\\|tool_response\\>).)+)?(?:\\<turn\\|\\>|\\<\\|tool_response\\>)?"
65
+ },
66
+ "soc_token": "<|channel>",
67
+ "sot_token": "<|turn>",
68
+ "stc_token": "<|tool_call>",
69
+ "std_token": "<|tool>",
70
+ "str_token": "<|tool_response>",
71
+ "think_token": "<|think|>",
72
+ "tokenizer_class": "GemmaTokenizer",
73
+ "unk_token": "<unk>",
74
+ "chat_template": "{%- macro format_parameters(properties, required, filter_keys=false) -%}\n {%- set standard_keys = ['description', 'type', 'properties', 'required', 'nullable'] -%}\n {%- set ns = namespace(found_first=false) -%}\n {%- for key, value in properties | dictsort -%}\n {%- set add_comma = false -%}\n {%- if not filter_keys or key not in standard_keys -%}\n {%- if ns.found_first %},{% endif -%}\n {%- set ns.found_first = true -%}\n {{ key }}:{\n {%- if value['description'] -%}\n description:<|\"|>{{ value['description'] }}<|\"|>\n {%- set add_comma = true -%}\n {%- endif -%}\n {%- if value['type'] | upper == 'STRING' -%}\n {%- if value['enum'] -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n enum:{{ format_argument(value['enum']) }}\n {%- endif -%}\n {%- elif value['type'] | upper == 'ARRAY' -%}\n {%- if value['items'] is mapping and value['items'] -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n items:{\n {%- set ns_items = namespace(found_first=false) -%}\n {%- for item_key, item_value in value['items'] | dictsort -%}\n {%- if item_value is not none -%}\n {%- if ns_items.found_first %},{% endif -%}\n {%- set ns_items.found_first = true -%}\n {%- if item_key == 'properties' -%}\n properties:{\n {%- if item_value is mapping -%}\n {{- format_parameters(item_value, value['items']['required'] | default([])) -}}\n {%- endif -%}\n }\n {%- elif item_key == 'required' -%}\n required:[\n {%- for req_item in item_value -%}\n <|\"|>{{- req_item -}}<|\"|>\n {%- if not loop.last %},{% endif -%}\n {%- endfor -%}\n ]\n {%- elif item_key == 'type' -%}\n {%- if item_value is string -%}\n type:{{ format_argument(item_value | upper) }}\n {%- else -%}\n type:{{ format_argument(item_value | map('upper') | list) }}\n {%- endif -%}\n {%- else -%}\n {{ item_key }}:{{ format_argument(item_value) }}\n {%- endif -%}\n {%- endif -%}\n {%- endfor -%}\n }\n {%- endif -%}\n {%- endif -%}\n {%- if value['nullable'] %}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n nullable:true\n {%- endif -%}\n {%- if value['type'] | upper == 'OBJECT' -%}\n {%- if value['properties'] is defined and value['properties'] is mapping -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n properties:{\n {{- format_parameters(value['properties'], value['required'] | default([])) -}}\n }\n {%- elif value is mapping -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n properties:{\n {{- format_parameters(value, value['required'] | default([]), filter_keys=true) -}}\n }\n {%- endif -%}\n {%- if value['required'] -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n required:[\n {%- for item in value['required'] | default([]) -%}\n <|\"|>{{- item -}}<|\"|>\n {%- if not loop.last %},{% endif -%}\n {%- endfor -%}\n ]\n {%- endif -%}\n {%- endif -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n type:<|\"|>{{ value['type'] | upper }}<|\"|>}\n {%- endif -%}\n {%- endfor -%}\n{%- endmacro -%}\n{%- macro format_function_declaration(tool_data) -%}\n declaration:{{- tool_data['function']['name'] -}}{description:<|\"|>{{- tool_data['function']['description'] -}}<|\"|>\n {%- set params = tool_data['function']['parameters'] -%}\n {%- if params -%}\n ,parameters:{\n {%- if params['properties'] -%}\n properties:{ {{- format_parameters(params['properties'], params['required']) -}} },\n {%- endif -%}\n {%- if params['required'] -%}\n required:[\n {%- for item in params['required'] -%}\n <|\"|>{{- item -}}<|\"|>\n {{- ',' if not loop.last -}}\n {%- endfor -%}\n ],\n {%- endif -%}\n {%- if params['type'] -%}\n type:<|\"|>{{- params['type'] | upper -}}<|\"|>}\n {%- endif -%}\n {%- endif -%}\n {%- if 'response' in tool_data['function'] -%}\n {%- set response_declaration = tool_data['function']['response'] -%}\n ,response:{\n {%- if response_declaration['description'] -%}\n description:<|\"|>{{- response_declaration['description'] -}}<|\"|>,\n {%- endif -%}\n {%- if response_declaration['type'] | upper == 'OBJECT' -%}\n type:<|\"|>{{- response_declaration['type'] | upper -}}<|\"|>}\n {%- endif -%}\n {%- endif -%}\n }\n{%- endmacro -%}\n{%- macro format_argument(argument, escape_keys=True) -%}\n {%- if argument is string -%}\n {{- '<|\"|>' + argument + '<|\"|>' -}}\n {%- elif argument is boolean -%}\n {{- 'true' if argument else 'false' -}}\n {%- elif argument is mapping -%}\n {{- '{' -}}\n {%- set ns = namespace(found_first=false) -%}\n {%- for key, value in argument | dictsort -%}\n {%- if ns.found_first %},{% endif -%}\n {%- set ns.found_first = true -%}\n {%- if escape_keys -%}\n {{- '<|\"|>' + key + '<|\"|>' -}}\n {%- else -%}\n {{- key -}}\n {%- endif -%}\n :{{- format_argument(value, escape_keys=escape_keys) -}}\n {%- endfor -%}\n {{- '}' -}}\n {%- elif argument is sequence -%}\n {{- '[' -}}\n {%- for item in argument -%}\n {{- format_argument(item, escape_keys=escape_keys) -}}\n {%- if not loop.last %},{% endif -%}\n {%- endfor -%}\n {{- ']' -}}\n {%- else -%}\n {{- argument -}}\n {%- endif -%}\n{%- endmacro -%}\n{%- macro strip_thinking(text) -%}\n {%- set ns = namespace(result='') -%}\n {%- for part in text.split('<channel|>') -%}\n {%- if '<|channel>' in part -%}\n {%- set ns.result = ns.result + part.split('<|channel>')[0] -%}\n {%- else -%}\n {%- set ns.result = ns.result + part -%}\n {%- endif -%}\n {%- endfor -%}\n {{- ns.result | trim -}}\n{%- endmacro -%}\n\n{%- macro format_tool_response_block(tool_name, response) -%}\n {{- '<|tool_response>' -}}\n {%- if response is mapping -%}\n {{- 'response:' + tool_name + '{' -}}\n {%- for key, value in response | dictsort -%}\n {{- key -}}:{{- format_argument(value, escape_keys=False) -}}\n {%- if not loop.last %},{% endif -%}\n {%- endfor -%}\n {{- '}' -}}\n {%- else -%}\n {{- 'response:' + tool_name + '{value:' + format_argument(response, escape_keys=False) + '}' -}}\n {%- endif -%}\n {{- '<tool_response|>' -}}\n{%- endmacro -%}\n\n{%- set ns = namespace(prev_message_type=None) -%}\n{%- set loop_messages = messages -%}\n{{- bos_token -}}\n{#- Handle System/Tool Definitions Block -#}\n{%- if (enable_thinking is defined and enable_thinking) or tools or messages[0]['role'] in ['system', 'developer'] -%}\n {{- '<|turn>system\\n' -}}\n {#- Inject Thinking token at the very top of the FIRST system turn -#}\n {%- if enable_thinking is defined and enable_thinking -%}\n {{- '<|think|>\\n' -}}\n {%- set ns.prev_message_type = 'think' -%}\n {%- endif -%}\n {%- if messages[0]['role'] in ['system', 'developer'] -%}\n {%- if messages[0]['content'] is string -%}\n {{- messages[0]['content'] | trim -}}\n {%- elif messages[0]['content'] is sequence -%}\n {%- for item in messages[0]['content'] -%}\n {{- item['text'] | trim + ' '-}}\n {%- endfor -%}\n {%- endif -%}\n {%- set loop_messages = messages[1:] -%}\n {%- endif -%}\n {%- if tools -%}\n {%- for tool in tools %}\n {{- '<|tool>' -}}\n {{- format_function_declaration(tool) | trim -}}\n {{- '<tool|>' -}}\n {%- endfor %}\n {%- set ns.prev_message_type = 'tool' -%}\n {%- if tool_choice is defined and ((tool_choice is string and tool_choice == 'required') or (tool_choice is mapping and tool_choice.get('type') == 'required')) -%}\n {{- '\\nTool use is REQUIRED for this turn: you must call exactly one of the declared tools. Output only the tool call, and copy every argument value verbatim \\u2014 character for character, preserving punctuation and newlines, with no added or removed whitespace.' -}}\n {%- endif -%}\n {%- endif -%}\n {{- '<turn|>\\n' -}}\n{%- endif %}\n\n{#- Pre-scan: find last user message index for reasoning guard -#}\n{%- set ns_turn = namespace(last_user_idx=-1) -%}\n{%- for i in range(loop_messages | length) -%}\n {%- if loop_messages[i]['role'] == 'user' -%}\n {%- set ns_turn.last_user_idx = i -%}\n {%- endif -%}\n{%- endfor -%}\n\n{#- Loop through messages -#}\n{%- for message in loop_messages -%}\n {%- if message['role'] != 'tool' -%}\n {%- set ns.prev_message_type = None -%}\n {%- set role = 'model' if message['role'] == 'assistant' else message['role'] -%}\n {#- Detect continuation: suppress duplicate <|turn>model when previous non-tool message was also assistant -#}\n {%- set prev_nt = namespace(role=None, found=false) -%}\n {%- if loop.index0 > 0 -%}\n {%- for j in range(loop.index0 - 1, -1, -1) -%}\n {%- if not prev_nt.found -%}\n {%- if loop_messages[j]['role'] != 'tool' -%}\n {%- set prev_nt.role = loop_messages[j]['role'] -%}\n {%- set prev_nt.found = true -%}\n {%- endif -%}\n {%- endif -%}\n {%- endfor -%}\n {%- endif -%}\n {%- set continue_same_model_turn = (role == 'model' and prev_nt.role == 'assistant') -%}\n {%- if not continue_same_model_turn -%}\n {{- '<|turn>' + role + '\\n' }}\n {%- endif -%}\n\n {#- Render reasoning/reasoning_content as thinking channel -#}\n {%- set thinking_text = message.get('reasoning') or message.get('reasoning_content') -%}\n {%- if thinking_text and loop.index0 > ns_turn.last_user_idx and message.get('tool_calls') -%}\n {{- '<|channel>thought\\n' + thinking_text + '\\n<channel|>' -}}\n {%- endif -%}\n\n {%- if message['tool_calls'] -%}\n {%- for tool_call in message['tool_calls'] -%}\n {%- set function = tool_call['function'] -%}\n {{- '<|tool_call>call:' + function['name'] + '{' -}}\n {%- if function['arguments'] is mapping -%}\n {%- set ns_args = namespace(found_first=false) -%}\n {%- for key, value in function['arguments'] | dictsort -%}\n {%- if ns_args.found_first %},{% endif -%}\n {%- set ns_args.found_first = true -%}\n {{- key -}}:{{- format_argument(value, escape_keys=False) -}}\n {%- endfor -%}\n {%- elif function['arguments'] is string -%}\n {{- function['arguments'] -}}\n {%- endif -%}\n {{- '}<tool_call|>' -}}\n {%- endfor -%}\n {%- set ns.prev_message_type = 'tool_call' -%}\n {%- endif -%}\n\n {%- set ns_tr_out = namespace(flag=false) -%}\n {%- if message.get('tool_responses') -%}\n {#- Legacy: tool_responses embedded on the assistant message (Google/Gemma native) -#}\n {%- for tool_response in message['tool_responses'] -%}\n {{- format_tool_response_block(tool_response['name'] | default('unknown'), tool_response['response']) -}}\n {%- set ns_tr_out.flag = true -%}\n {%- set ns.prev_message_type = 'tool_response' -%}\n {%- endfor -%}\n {%- elif message.get('tool_calls') -%}\n {#- OpenAI Chat Completions: forward-scan consecutive role:tool messages -#}\n {%- set ns_tool_scan = namespace(stopped=false) -%}\n {%- for k in range(loop.index0 + 1, loop_messages | length) -%}\n {%- if ns_tool_scan.stopped -%}\n {%- elif loop_messages[k]['role'] != 'tool' -%}\n {%- set ns_tool_scan.stopped = true -%}\n {%- else -%}\n {%- set follow = loop_messages[k] -%}\n {#- Resolve tool_call_id to function name -#}\n {%- set ns_tname = namespace(name=follow.get('name') | default('unknown')) -%}\n {%- for tc in message['tool_calls'] -%}\n {%- if tc.get('id') == follow.get('tool_call_id') -%}\n {%- set ns_tname.name = tc['function']['name'] -%}\n {%- endif -%}\n {%- endfor -%}\n {#- Handle content as string or content-parts array -#}\n {%- set tool_body = follow.get('content') -%}\n {%- if tool_body is string -%}\n {{- format_tool_response_block(ns_tname.name, tool_body) -}}\n {%- elif tool_body is sequence and tool_body is not string -%}\n {%- set ns_txt = namespace(s='') -%}\n {%- for part in tool_body -%}\n {%- if part.get('type') == 'text' -%}\n {%- set ns_txt.s = ns_txt.s + (part.get('text') | default('')) -%}\n {%- endif -%}\n {%- endfor -%}\n {{- format_tool_response_block(ns_tname.name, ns_txt.s) -}}\n {%- for part in tool_body -%}\n {%- if part.get('type') == 'image' -%}\n {{- '<|image|>' -}}\n {%- elif part.get('type') == 'audio' -%}\n {{- '<|audio|>' -}}\n {%- elif part.get('type') == 'video' -%}\n {{- '<|video|>' -}}\n {%- endif -%}\n {%- endfor -%}\n {%- else -%}\n {{- format_tool_response_block(ns_tname.name, tool_body) -}}\n {%- endif -%}\n {%- set ns_tr_out.flag = true -%}\n {%- set ns.prev_message_type = 'tool_response' -%}\n {%- endif -%}\n {%- endfor -%}\n {%- endif -%}\n\n {%- set captured_content -%}\n {%- if message['content'] is string -%}\n {%- if role == 'model' -%}\n {{- strip_thinking(message['content']) -}}\n {%- else -%}\n {{- message['content'] | trim -}}\n {%- endif -%}\n {%- elif message['content'] is sequence -%}\n {%- for item in message['content'] -%}\n {%- if item['type'] == 'text' -%}\n {%- if role == 'model' -%}\n {{- strip_thinking(item['text']) -}}\n {%- else -%}\n {{- item['text'] | trim -}}\n {%- endif -%}\n {%- elif item['type'] == 'image' -%}\n {{- '<|image|>' -}}\n {%- set ns.prev_message_type = 'image' -%}\n {%- elif item['type'] == 'audio' -%}\n {{- '<|audio|>' -}}\n {%- set ns.prev_message_type = 'audio' -%}\n {%- elif item['type'] == 'video' -%}\n {{- '<|video|>' -}}\n {%- set ns.prev_message_type = 'video' -%}\n {%- endif -%}\n {%- endfor -%}\n {%- endif -%}\n {%- endset -%}\n\n {{- captured_content -}}\n {%- set has_content = captured_content | trim | length > 0 -%}\n\n {%- if ns.prev_message_type == 'tool_call' and not ns_tr_out.flag -%}\n {{- '<|tool_response>' -}}\n {%- elif not (ns_tr_out.flag and not has_content) -%}\n {{- '<turn|>\\n' -}}\n {%- endif -%}\n {%- endif -%}\n{%- endfor -%}\n\n{%- if add_generation_prompt -%}\n {%- if ns.prev_message_type != 'tool_response' and ns.prev_message_type != 'tool_call' -%}\n {{- '<|turn>model\\n' -}}\n {%- endif -%}\n{%- endif -%}"
75
+ }
vmlx-banner.png ADDED