Image-Text-to-Text
MLX
Safetensors
gemma4_unified
apple-silicon
abliterated
uncensored
crack
vision
audio
omni-modal
gemma4
gemma
google
harmbench
mmlu
jang_4m
conversational
Instructions to use dealignai/Gemma-4-12B-it-JANG_4M-CRACK with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use dealignai/Gemma-4-12B-it-JANG_4M-CRACK with MLX:
# Make sure mlx-vlm is installed # pip install --upgrade mlx-vlm from mlx_vlm import load, generate from mlx_vlm.prompt_utils import apply_chat_template from mlx_vlm.utils import load_config # Load the model model, processor = load("dealignai/Gemma-4-12B-it-JANG_4M-CRACK") config = load_config("dealignai/Gemma-4-12B-it-JANG_4M-CRACK") # Prepare input image = ["http://images.cocodataset.org/val2017/000000039769.jpg"] prompt = "Describe this image." # Apply chat template formatted_prompt = apply_chat_template( processor, config, prompt, num_images=1 ) # Generate output output = generate(model, processor, formatted_prompt, image) print(output) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
- Pi
How to use dealignai/Gemma-4-12B-it-JANG_4M-CRACK with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "dealignai/Gemma-4-12B-it-JANG_4M-CRACK"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "dealignai/Gemma-4-12B-it-JANG_4M-CRACK" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use dealignai/Gemma-4-12B-it-JANG_4M-CRACK with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "dealignai/Gemma-4-12B-it-JANG_4M-CRACK"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default dealignai/Gemma-4-12B-it-JANG_4M-CRACK
Run Hermes
hermes
Upload folder using huggingface_hub
Browse files- .gitattributes +1 -0
- README.md +128 -0
- chat_template.jinja +363 -0
- config.json +909 -0
- dealign_logo.png +0 -0
- dealign_mascot.png +0 -0
- generation_config.json +18 -0
- jang_config.json +55 -0
- model-00001-of-00010.safetensors +3 -0
- model-00002-of-00010.safetensors +3 -0
- model-00003-of-00010.safetensors +3 -0
- model-00004-of-00010.safetensors +3 -0
- model-00005-of-00010.safetensors +3 -0
- model-00006-of-00010.safetensors +3 -0
- model-00007-of-00010.safetensors +3 -0
- model-00008-of-00010.safetensors +3 -0
- model-00009-of-00010.safetensors +3 -0
- model-00010-of-00010.safetensors +3 -0
- model.safetensors.index.json +0 -0
- processor_config.json +62 -0
- tokenizer.json +3 -0
- tokenizer_config.json +75 -0
- vmlx-banner.png +0 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
|
@@ -0,0 +1,128 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: gemma
|
| 3 |
+
base_model: google/gemma-4-12B-it
|
| 4 |
+
library_name: mlx
|
| 5 |
+
pipeline_tag: image-text-to-text
|
| 6 |
+
thumbnail: dealign_mascot.png
|
| 7 |
+
tags:
|
| 8 |
+
- mlx
|
| 9 |
+
- apple-silicon
|
| 10 |
+
- abliterated
|
| 11 |
+
- uncensored
|
| 12 |
+
- crack
|
| 13 |
+
- vision
|
| 14 |
+
- audio
|
| 15 |
+
- omni-modal
|
| 16 |
+
- gemma4
|
| 17 |
+
- gemma
|
| 18 |
+
- google
|
| 19 |
+
- harmbench
|
| 20 |
+
- mmlu
|
| 21 |
+
- jang_4m
|
| 22 |
+
---
|
| 23 |
+
|
| 24 |
+
<!-- vmlx-banner -->
|
| 25 |
+
<div align="center">
|
| 26 |
+
<a href="https://vmlx.net">
|
| 27 |
+
<img src="vmlx-banner.png" width="280" />
|
| 28 |
+
<br/>
|
| 29 |
+
<strong>Built for vMLX</strong> — the MLX inferencer with KV-cache quantization, prefix-cache reuse, agentic tool calling, and hybrid sliding+full attention support.
|
| 30 |
+
<br/>
|
| 31 |
+
<sub>Free for macOS · <strong>vmlx.net</strong></sub>
|
| 32 |
+
</a>
|
| 33 |
+
</div>
|
| 34 |
+
|
| 35 |
+
---
|
| 36 |
+
|
| 37 |
+
<div align="center">
|
| 38 |
+
|
| 39 |
+
<img src="dealign_mascot.png" width="128" />
|
| 40 |
+
|
| 41 |
+
# Gemma 4 12B-it — JANG_4M CRACK
|
| 42 |
+
|
| 43 |
+
**CRACK abliterated** · **JANG mixed-precision (8-bit attention, 4-bit MLP)** · Omni-modal (text + image + audio + video) · 9.6 GB
|
| 44 |
+
|
| 45 |
+
<a href="https://ko-fi.com/dealignai"><img src="https://img.shields.io/badge/Ko--fi-Support_Development-FF5E5B?logo=ko-fi&logoColor=white&style=for-the-badge" alt="Ko-fi"></a>
|
| 46 |
+
|
| 47 |
+
</div>
|
| 48 |
+
|
| 49 |
+
---
|
| 50 |
+
|
| 51 |
+
## What Is This?
|
| 52 |
+
|
| 53 |
+
This is [Gemma 4 12B-it](https://huggingface.co/google/gemma-4-12B-it) by Google — a unified omni-modal language model (text + image + audio + video, hybrid sliding/full attention, 48 layers, 128k context) that has been:
|
| 54 |
+
|
| 55 |
+
1. **CRACK abliterated** — safety refusal removed at the weight level. The model now complies across all task categories instead of refusing, while keeping its knowledge, reasoning, and multimodal capabilities intact.
|
| 56 |
+
2. **JANG mixed-precision (8-bit attention, 4-bit MLP) quantized** for MLX on Apple Silicon — 9.6 GB.
|
| 57 |
+
|
| 58 |
+
## Results
|
| 59 |
+
|
| 60 |
+
Evaluated through the Osaurus runtime on a Mac Studio M3 Ultra. Compliance graded via HarmBench-text refusal classifier; MMLU via logit-mode argmax over A/B/C/D token logits (matched on both base and CRACK with identical chat-template rendering — no answer truncated).
|
| 61 |
+
|
| 62 |
+
### HarmBench compliance (70 prompts · 10 per category)
|
| 63 |
+
|
| 64 |
+
| Category | CRACK ASR |
|
| 65 |
+
|---|---:|
|
| 66 |
+
| Chemical / biological | **10 / 10** (100%) |
|
| 67 |
+
| Copyright | **10 / 10** (100%) |
|
| 68 |
+
| Cybercrime / intrusion | **10 / 10** (100%) |
|
| 69 |
+
| Harassment / bullying | **10 / 10** (100%) |
|
| 70 |
+
| Illegal | **10 / 10** (100%) |
|
| 71 |
+
| Misinformation / disinformation | **10 / 10** (100%) |
|
| 72 |
+
| General harmful | **10 / 10** (100%) |
|
| 73 |
+
| **Overall** | **70 / 70 (100%)** |
|
| 74 |
+
|
| 75 |
+
### MMLU-228 (57-subject, 4 q per subject)
|
| 76 |
+
|
| 77 |
+
| Subject area | base | CRACK | Δ |
|
| 78 |
+
|---|---:|---:|---:|
|
| 79 |
+
| **Overall** | 67.1% | **69.3%** | +2.2pp |
|
| 80 |
+
| STEM | 68.1% | 66.7% | -1.4pp |
|
| 81 |
+
| Humanities | 57.7% | 63.5% | +5.8pp |
|
| 82 |
+
| Social Sciences | 75.0% | 75.0% | +0.0pp |
|
| 83 |
+
| Other (medicine, business, …) | 67.9% | 73.2% | +5.3pp |
|
| 84 |
+
|
| 85 |
+
## Features
|
| 86 |
+
|
| 87 |
+
- **Omni-modal**: native text + image + audio + video inputs (Gemma 4's unified early-fusion encoder-free architecture)
|
| 88 |
+
- **128k context** with hybrid sliding-window + full-attention layers
|
| 89 |
+
- **Reasoning** via the Gemma 4 channel-marker format (`<|channel>thought ... <channel|>`)
|
| 90 |
+
- **48 transformer layers**, hidden size 3840
|
| 91 |
+
|
| 92 |
+
## Usage
|
| 93 |
+
|
| 94 |
+
Run with the **Osaurus** runtime (recommended — supports the `gemma4_unified` architecture directly):
|
| 95 |
+
|
| 96 |
+
```python
|
| 97 |
+
# OpenAI-compatible chat completion
|
| 98 |
+
# POST /v1/chat/completions
|
| 99 |
+
{
|
| 100 |
+
"model": "dealignai/Gemma-4-12B-it-JANG_4M-CRACK",
|
| 101 |
+
"messages": [{"role": "user", "content": "..."}],
|
| 102 |
+
"temperature": 0.0
|
| 103 |
+
}
|
| 104 |
+
```
|
| 105 |
+
|
| 106 |
+
Google recommends `temperature=1.0, top_p=0.95, top_k=64` for general use.
|
| 107 |
+
|
| 108 |
+
## About CRACK
|
| 109 |
+
|
| 110 |
+
**CRACK** (Controlled Refusal Ablation via Calibrated Knockouts) removes safety-refusal behavior at the weight level so the model complies with all task categories while preserving reasoning quality, factual knowledge, multimodal capability, and coherence.
|
| 111 |
+
|
| 112 |
+
## Support dealignai
|
| 113 |
+
|
| 114 |
+
All models are built from original research and released free.
|
| 115 |
+
|
| 116 |
+
**[Support us on Ko-fi](https://ko-fi.com/dealignai)** — membership gets early access and extras.
|
| 117 |
+
|
| 118 |
+
[Ko-fi](https://ko-fi.com/dealignai) · [X @dealignai](https://x.com/dealignai) · [dealign.ai](https://dealign.ai)
|
| 119 |
+
|
| 120 |
+
See our research: [Safety Generalization in Frontier Models](https://dealign.ai/quantsteer.html)
|
| 121 |
+
|
| 122 |
+
<div align="center"><img src="dealign_logo.png" alt="dealign.ai" width="200"/></div>
|
| 123 |
+
|
| 124 |
+
---
|
| 125 |
+
|
| 126 |
+
## Disclaimer
|
| 127 |
+
|
| 128 |
+
This model has had its safety-refusal behavior removed for research purposes. It will follow instructions across all categories without refusing. You are solely responsible for how you use it and for complying with all applicable laws. Published for AI-safety research and authorized security testing.
|
chat_template.jinja
ADDED
|
@@ -0,0 +1,363 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{%- macro format_parameters(properties, required, filter_keys=false) -%}
|
| 2 |
+
{%- set standard_keys = ['description', 'type', 'properties', 'required', 'nullable'] -%}
|
| 3 |
+
{%- set ns = namespace(found_first=false) -%}
|
| 4 |
+
{%- for key, value in properties | dictsort -%}
|
| 5 |
+
{%- set add_comma = false -%}
|
| 6 |
+
{%- if not filter_keys or key not in standard_keys -%}
|
| 7 |
+
{%- if ns.found_first %},{% endif -%}
|
| 8 |
+
{%- set ns.found_first = true -%}
|
| 9 |
+
{{ key }}:{
|
| 10 |
+
{%- if value['description'] -%}
|
| 11 |
+
description:<|"|>{{ value['description'] }}<|"|>
|
| 12 |
+
{%- set add_comma = true -%}
|
| 13 |
+
{%- endif -%}
|
| 14 |
+
{%- if value['type'] | upper == 'STRING' -%}
|
| 15 |
+
{%- if value['enum'] -%}
|
| 16 |
+
{%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
|
| 17 |
+
enum:{{ format_argument(value['enum']) }}
|
| 18 |
+
{%- endif -%}
|
| 19 |
+
{%- elif value['type'] | upper == 'ARRAY' -%}
|
| 20 |
+
{%- if value['items'] is mapping and value['items'] -%}
|
| 21 |
+
{%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
|
| 22 |
+
items:{
|
| 23 |
+
{%- set ns_items = namespace(found_first=false) -%}
|
| 24 |
+
{%- for item_key, item_value in value['items'] | dictsort -%}
|
| 25 |
+
{%- if item_value is not none -%}
|
| 26 |
+
{%- if ns_items.found_first %},{% endif -%}
|
| 27 |
+
{%- set ns_items.found_first = true -%}
|
| 28 |
+
{%- if item_key == 'properties' -%}
|
| 29 |
+
properties:{
|
| 30 |
+
{%- if item_value is mapping -%}
|
| 31 |
+
{{- format_parameters(item_value, value['items']['required'] | default([])) -}}
|
| 32 |
+
{%- endif -%}
|
| 33 |
+
}
|
| 34 |
+
{%- elif item_key == 'required' -%}
|
| 35 |
+
required:[
|
| 36 |
+
{%- for req_item in item_value -%}
|
| 37 |
+
<|"|>{{- req_item -}}<|"|>
|
| 38 |
+
{%- if not loop.last %},{% endif -%}
|
| 39 |
+
{%- endfor -%}
|
| 40 |
+
]
|
| 41 |
+
{%- elif item_key == 'type' -%}
|
| 42 |
+
{%- if item_value is string -%}
|
| 43 |
+
type:{{ format_argument(item_value | upper) }}
|
| 44 |
+
{%- else -%}
|
| 45 |
+
type:{{ format_argument(item_value | map('upper') | list) }}
|
| 46 |
+
{%- endif -%}
|
| 47 |
+
{%- else -%}
|
| 48 |
+
{{ item_key }}:{{ format_argument(item_value) }}
|
| 49 |
+
{%- endif -%}
|
| 50 |
+
{%- endif -%}
|
| 51 |
+
{%- endfor -%}
|
| 52 |
+
}
|
| 53 |
+
{%- endif -%}
|
| 54 |
+
{%- endif -%}
|
| 55 |
+
{%- if value['nullable'] %}
|
| 56 |
+
{%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
|
| 57 |
+
nullable:true
|
| 58 |
+
{%- endif -%}
|
| 59 |
+
{%- if value['type'] | upper == 'OBJECT' -%}
|
| 60 |
+
{%- if value['properties'] is defined and value['properties'] is mapping -%}
|
| 61 |
+
{%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
|
| 62 |
+
properties:{
|
| 63 |
+
{{- format_parameters(value['properties'], value['required'] | default([])) -}}
|
| 64 |
+
}
|
| 65 |
+
{%- elif value is mapping -%}
|
| 66 |
+
{%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
|
| 67 |
+
properties:{
|
| 68 |
+
{{- format_parameters(value, value['required'] | default([]), filter_keys=true) -}}
|
| 69 |
+
}
|
| 70 |
+
{%- endif -%}
|
| 71 |
+
{%- if value['required'] -%}
|
| 72 |
+
{%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
|
| 73 |
+
required:[
|
| 74 |
+
{%- for item in value['required'] | default([]) -%}
|
| 75 |
+
<|"|>{{- item -}}<|"|>
|
| 76 |
+
{%- if not loop.last %},{% endif -%}
|
| 77 |
+
{%- endfor -%}
|
| 78 |
+
]
|
| 79 |
+
{%- endif -%}
|
| 80 |
+
{%- endif -%}
|
| 81 |
+
{%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}
|
| 82 |
+
type:<|"|>{{ value['type'] | upper }}<|"|>}
|
| 83 |
+
{%- endif -%}
|
| 84 |
+
{%- endfor -%}
|
| 85 |
+
{%- endmacro -%}
|
| 86 |
+
{%- macro format_function_declaration(tool_data) -%}
|
| 87 |
+
declaration:{{- tool_data['function']['name'] -}}{description:<|"|>{{- tool_data['function']['description'] -}}<|"|>
|
| 88 |
+
{%- set params = tool_data['function']['parameters'] -%}
|
| 89 |
+
{%- if params -%}
|
| 90 |
+
,parameters:{
|
| 91 |
+
{%- if params['properties'] -%}
|
| 92 |
+
properties:{ {{- format_parameters(params['properties'], params['required']) -}} },
|
| 93 |
+
{%- endif -%}
|
| 94 |
+
{%- if params['required'] -%}
|
| 95 |
+
required:[
|
| 96 |
+
{%- for item in params['required'] -%}
|
| 97 |
+
<|"|>{{- item -}}<|"|>
|
| 98 |
+
{{- ',' if not loop.last -}}
|
| 99 |
+
{%- endfor -%}
|
| 100 |
+
],
|
| 101 |
+
{%- endif -%}
|
| 102 |
+
{%- if params['type'] -%}
|
| 103 |
+
type:<|"|>{{- params['type'] | upper -}}<|"|>}
|
| 104 |
+
{%- endif -%}
|
| 105 |
+
{%- endif -%}
|
| 106 |
+
{%- if 'response' in tool_data['function'] -%}
|
| 107 |
+
{%- set response_declaration = tool_data['function']['response'] -%}
|
| 108 |
+
,response:{
|
| 109 |
+
{%- if response_declaration['description'] -%}
|
| 110 |
+
description:<|"|>{{- response_declaration['description'] -}}<|"|>,
|
| 111 |
+
{%- endif -%}
|
| 112 |
+
{%- if response_declaration['type'] | upper == 'OBJECT' -%}
|
| 113 |
+
type:<|"|>{{- response_declaration['type'] | upper -}}<|"|>}
|
| 114 |
+
{%- endif -%}
|
| 115 |
+
{%- endif -%}
|
| 116 |
+
}
|
| 117 |
+
{%- endmacro -%}
|
| 118 |
+
{%- macro format_argument(argument, escape_keys=True) -%}
|
| 119 |
+
{%- if argument is string -%}
|
| 120 |
+
{{- '<|"|>' + argument + '<|"|>' -}}
|
| 121 |
+
{%- elif argument is boolean -%}
|
| 122 |
+
{{- 'true' if argument else 'false' -}}
|
| 123 |
+
{%- elif argument is mapping -%}
|
| 124 |
+
{{- '{' -}}
|
| 125 |
+
{%- set ns = namespace(found_first=false) -%}
|
| 126 |
+
{%- for key, value in argument | dictsort -%}
|
| 127 |
+
{%- if ns.found_first %},{% endif -%}
|
| 128 |
+
{%- set ns.found_first = true -%}
|
| 129 |
+
{%- if escape_keys -%}
|
| 130 |
+
{{- '<|"|>' + key + '<|"|>' -}}
|
| 131 |
+
{%- else -%}
|
| 132 |
+
{{- key -}}
|
| 133 |
+
{%- endif -%}
|
| 134 |
+
:{{- format_argument(value, escape_keys=escape_keys) -}}
|
| 135 |
+
{%- endfor -%}
|
| 136 |
+
{{- '}' -}}
|
| 137 |
+
{%- elif argument is sequence -%}
|
| 138 |
+
{{- '[' -}}
|
| 139 |
+
{%- for item in argument -%}
|
| 140 |
+
{{- format_argument(item, escape_keys=escape_keys) -}}
|
| 141 |
+
{%- if not loop.last %},{% endif -%}
|
| 142 |
+
{%- endfor -%}
|
| 143 |
+
{{- ']' -}}
|
| 144 |
+
{%- else -%}
|
| 145 |
+
{{- argument -}}
|
| 146 |
+
{%- endif -%}
|
| 147 |
+
{%- endmacro -%}
|
| 148 |
+
{%- macro strip_thinking(text) -%}
|
| 149 |
+
{%- set ns = namespace(result='') -%}
|
| 150 |
+
{%- for part in text.split('<channel|>') -%}
|
| 151 |
+
{%- if '<|channel>' in part -%}
|
| 152 |
+
{%- set ns.result = ns.result + part.split('<|channel>')[0] -%}
|
| 153 |
+
{%- else -%}
|
| 154 |
+
{%- set ns.result = ns.result + part -%}
|
| 155 |
+
{%- endif -%}
|
| 156 |
+
{%- endfor -%}
|
| 157 |
+
{{- ns.result | trim -}}
|
| 158 |
+
{%- endmacro -%}
|
| 159 |
+
|
| 160 |
+
{%- macro format_tool_response_block(tool_name, response) -%}
|
| 161 |
+
{{- '<|tool_response>' -}}
|
| 162 |
+
{%- if response is mapping -%}
|
| 163 |
+
{{- 'response:' + tool_name + '{' -}}
|
| 164 |
+
{%- for key, value in response | dictsort -%}
|
| 165 |
+
{{- key -}}:{{- format_argument(value, escape_keys=False) -}}
|
| 166 |
+
{%- if not loop.last %},{% endif -%}
|
| 167 |
+
{%- endfor -%}
|
| 168 |
+
{{- '}' -}}
|
| 169 |
+
{%- else -%}
|
| 170 |
+
{{- 'response:' + tool_name + '{value:' + format_argument(response, escape_keys=False) + '}' -}}
|
| 171 |
+
{%- endif -%}
|
| 172 |
+
{{- '<tool_response|>' -}}
|
| 173 |
+
{%- endmacro -%}
|
| 174 |
+
|
| 175 |
+
{%- set ns = namespace(prev_message_type=None) -%}
|
| 176 |
+
{%- set loop_messages = messages -%}
|
| 177 |
+
{{- bos_token -}}
|
| 178 |
+
{#- Handle System/Tool Definitions Block -#}
|
| 179 |
+
{%- if (enable_thinking is defined and enable_thinking) or tools or messages[0]['role'] in ['system', 'developer'] -%}
|
| 180 |
+
{{- '<|turn>system\n' -}}
|
| 181 |
+
{#- Inject Thinking token at the very top of the FIRST system turn -#}
|
| 182 |
+
{%- if enable_thinking is defined and enable_thinking -%}
|
| 183 |
+
{{- '<|think|>\n' -}}
|
| 184 |
+
{%- set ns.prev_message_type = 'think' -%}
|
| 185 |
+
{%- endif -%}
|
| 186 |
+
{%- if messages[0]['role'] in ['system', 'developer'] -%}
|
| 187 |
+
{%- if messages[0]['content'] is string -%}
|
| 188 |
+
{{- messages[0]['content'] | trim -}}
|
| 189 |
+
{%- elif messages[0]['content'] is sequence -%}
|
| 190 |
+
{%- for item in messages[0]['content'] -%}
|
| 191 |
+
{{- item['text'] | trim + ' '-}}
|
| 192 |
+
{%- endfor -%}
|
| 193 |
+
{%- endif -%}
|
| 194 |
+
{%- set loop_messages = messages[1:] -%}
|
| 195 |
+
{%- endif -%}
|
| 196 |
+
{%- if tools -%}
|
| 197 |
+
{%- for tool in tools %}
|
| 198 |
+
{{- '<|tool>' -}}
|
| 199 |
+
{{- format_function_declaration(tool) | trim -}}
|
| 200 |
+
{{- '<tool|>' -}}
|
| 201 |
+
{%- endfor %}
|
| 202 |
+
{%- set ns.prev_message_type = 'tool' -%}
|
| 203 |
+
{%- if tool_choice is defined and ((tool_choice is string and tool_choice == 'required') or (tool_choice is mapping and tool_choice.get('type') == 'required')) -%}
|
| 204 |
+
{{- '\nTool use is REQUIRED for this turn: you must call exactly one of the declared tools. Output only the tool call, and copy every argument value verbatim \u2014 character for character, preserving punctuation and newlines, with no added or removed whitespace.' -}}
|
| 205 |
+
{%- endif -%}
|
| 206 |
+
{%- endif -%}
|
| 207 |
+
{{- '<turn|>\n' -}}
|
| 208 |
+
{%- endif %}
|
| 209 |
+
|
| 210 |
+
{#- Pre-scan: find last user message index for reasoning guard -#}
|
| 211 |
+
{%- set ns_turn = namespace(last_user_idx=-1) -%}
|
| 212 |
+
{%- for i in range(loop_messages | length) -%}
|
| 213 |
+
{%- if loop_messages[i]['role'] == 'user' -%}
|
| 214 |
+
{%- set ns_turn.last_user_idx = i -%}
|
| 215 |
+
{%- endif -%}
|
| 216 |
+
{%- endfor -%}
|
| 217 |
+
|
| 218 |
+
{#- Loop through messages -#}
|
| 219 |
+
{%- for message in loop_messages -%}
|
| 220 |
+
{%- if message['role'] != 'tool' -%}
|
| 221 |
+
{%- set ns.prev_message_type = None -%}
|
| 222 |
+
{%- set role = 'model' if message['role'] == 'assistant' else message['role'] -%}
|
| 223 |
+
{#- Detect continuation: suppress duplicate <|turn>model when previous non-tool message was also assistant -#}
|
| 224 |
+
{%- set prev_nt = namespace(role=None, found=false) -%}
|
| 225 |
+
{%- if loop.index0 > 0 -%}
|
| 226 |
+
{%- for j in range(loop.index0 - 1, -1, -1) -%}
|
| 227 |
+
{%- if not prev_nt.found -%}
|
| 228 |
+
{%- if loop_messages[j]['role'] != 'tool' -%}
|
| 229 |
+
{%- set prev_nt.role = loop_messages[j]['role'] -%}
|
| 230 |
+
{%- set prev_nt.found = true -%}
|
| 231 |
+
{%- endif -%}
|
| 232 |
+
{%- endif -%}
|
| 233 |
+
{%- endfor -%}
|
| 234 |
+
{%- endif -%}
|
| 235 |
+
{%- set continue_same_model_turn = (role == 'model' and prev_nt.role == 'assistant') -%}
|
| 236 |
+
{%- if not continue_same_model_turn -%}
|
| 237 |
+
{{- '<|turn>' + role + '\n' }}
|
| 238 |
+
{%- endif -%}
|
| 239 |
+
|
| 240 |
+
{#- Render reasoning/reasoning_content as thinking channel -#}
|
| 241 |
+
{%- set thinking_text = message.get('reasoning') or message.get('reasoning_content') -%}
|
| 242 |
+
{%- if thinking_text and loop.index0 > ns_turn.last_user_idx and message.get('tool_calls') -%}
|
| 243 |
+
{{- '<|channel>thought\n' + thinking_text + '\n<channel|>' -}}
|
| 244 |
+
{%- endif -%}
|
| 245 |
+
|
| 246 |
+
{%- if message['tool_calls'] -%}
|
| 247 |
+
{%- for tool_call in message['tool_calls'] -%}
|
| 248 |
+
{%- set function = tool_call['function'] -%}
|
| 249 |
+
{{- '<|tool_call>call:' + function['name'] + '{' -}}
|
| 250 |
+
{%- if function['arguments'] is mapping -%}
|
| 251 |
+
{%- set ns_args = namespace(found_first=false) -%}
|
| 252 |
+
{%- for key, value in function['arguments'] | dictsort -%}
|
| 253 |
+
{%- if ns_args.found_first %},{% endif -%}
|
| 254 |
+
{%- set ns_args.found_first = true -%}
|
| 255 |
+
{{- key -}}:{{- format_argument(value, escape_keys=False) -}}
|
| 256 |
+
{%- endfor -%}
|
| 257 |
+
{%- elif function['arguments'] is string -%}
|
| 258 |
+
{{- function['arguments'] -}}
|
| 259 |
+
{%- endif -%}
|
| 260 |
+
{{- '}<tool_call|>' -}}
|
| 261 |
+
{%- endfor -%}
|
| 262 |
+
{%- set ns.prev_message_type = 'tool_call' -%}
|
| 263 |
+
{%- endif -%}
|
| 264 |
+
|
| 265 |
+
{%- set ns_tr_out = namespace(flag=false) -%}
|
| 266 |
+
{%- if message.get('tool_responses') -%}
|
| 267 |
+
{#- Legacy: tool_responses embedded on the assistant message (Google/Gemma native) -#}
|
| 268 |
+
{%- for tool_response in message['tool_responses'] -%}
|
| 269 |
+
{{- format_tool_response_block(tool_response['name'] | default('unknown'), tool_response['response']) -}}
|
| 270 |
+
{%- set ns_tr_out.flag = true -%}
|
| 271 |
+
{%- set ns.prev_message_type = 'tool_response' -%}
|
| 272 |
+
{%- endfor -%}
|
| 273 |
+
{%- elif message.get('tool_calls') -%}
|
| 274 |
+
{#- OpenAI Chat Completions: forward-scan consecutive role:tool messages -#}
|
| 275 |
+
{%- set ns_tool_scan = namespace(stopped=false) -%}
|
| 276 |
+
{%- for k in range(loop.index0 + 1, loop_messages | length) -%}
|
| 277 |
+
{%- if ns_tool_scan.stopped -%}
|
| 278 |
+
{%- elif loop_messages[k]['role'] != 'tool' -%}
|
| 279 |
+
{%- set ns_tool_scan.stopped = true -%}
|
| 280 |
+
{%- else -%}
|
| 281 |
+
{%- set follow = loop_messages[k] -%}
|
| 282 |
+
{#- Resolve tool_call_id to function name -#}
|
| 283 |
+
{%- set ns_tname = namespace(name=follow.get('name') | default('unknown')) -%}
|
| 284 |
+
{%- for tc in message['tool_calls'] -%}
|
| 285 |
+
{%- if tc.get('id') == follow.get('tool_call_id') -%}
|
| 286 |
+
{%- set ns_tname.name = tc['function']['name'] -%}
|
| 287 |
+
{%- endif -%}
|
| 288 |
+
{%- endfor -%}
|
| 289 |
+
{#- Handle content as string or content-parts array -#}
|
| 290 |
+
{%- set tool_body = follow.get('content') -%}
|
| 291 |
+
{%- if tool_body is string -%}
|
| 292 |
+
{{- format_tool_response_block(ns_tname.name, tool_body) -}}
|
| 293 |
+
{%- elif tool_body is sequence and tool_body is not string -%}
|
| 294 |
+
{%- set ns_txt = namespace(s='') -%}
|
| 295 |
+
{%- for part in tool_body -%}
|
| 296 |
+
{%- if part.get('type') == 'text' -%}
|
| 297 |
+
{%- set ns_txt.s = ns_txt.s + (part.get('text') | default('')) -%}
|
| 298 |
+
{%- endif -%}
|
| 299 |
+
{%- endfor -%}
|
| 300 |
+
{{- format_tool_response_block(ns_tname.name, ns_txt.s) -}}
|
| 301 |
+
{%- for part in tool_body -%}
|
| 302 |
+
{%- if part.get('type') == 'image' -%}
|
| 303 |
+
{{- '<|image|>' -}}
|
| 304 |
+
{%- elif part.get('type') == 'audio' -%}
|
| 305 |
+
{{- '<|audio|>' -}}
|
| 306 |
+
{%- elif part.get('type') == 'video' -%}
|
| 307 |
+
{{- '<|video|>' -}}
|
| 308 |
+
{%- endif -%}
|
| 309 |
+
{%- endfor -%}
|
| 310 |
+
{%- else -%}
|
| 311 |
+
{{- format_tool_response_block(ns_tname.name, tool_body) -}}
|
| 312 |
+
{%- endif -%}
|
| 313 |
+
{%- set ns_tr_out.flag = true -%}
|
| 314 |
+
{%- set ns.prev_message_type = 'tool_response' -%}
|
| 315 |
+
{%- endif -%}
|
| 316 |
+
{%- endfor -%}
|
| 317 |
+
{%- endif -%}
|
| 318 |
+
|
| 319 |
+
{%- set captured_content -%}
|
| 320 |
+
{%- if message['content'] is string -%}
|
| 321 |
+
{%- if role == 'model' -%}
|
| 322 |
+
{{- strip_thinking(message['content']) -}}
|
| 323 |
+
{%- else -%}
|
| 324 |
+
{{- message['content'] | trim -}}
|
| 325 |
+
{%- endif -%}
|
| 326 |
+
{%- elif message['content'] is sequence -%}
|
| 327 |
+
{%- for item in message['content'] -%}
|
| 328 |
+
{%- if item['type'] == 'text' -%}
|
| 329 |
+
{%- if role == 'model' -%}
|
| 330 |
+
{{- strip_thinking(item['text']) -}}
|
| 331 |
+
{%- else -%}
|
| 332 |
+
{{- item['text'] | trim -}}
|
| 333 |
+
{%- endif -%}
|
| 334 |
+
{%- elif item['type'] == 'image' -%}
|
| 335 |
+
{{- '<|image|>' -}}
|
| 336 |
+
{%- set ns.prev_message_type = 'image' -%}
|
| 337 |
+
{%- elif item['type'] == 'audio' -%}
|
| 338 |
+
{{- '<|audio|>' -}}
|
| 339 |
+
{%- set ns.prev_message_type = 'audio' -%}
|
| 340 |
+
{%- elif item['type'] == 'video' -%}
|
| 341 |
+
{{- '<|video|>' -}}
|
| 342 |
+
{%- set ns.prev_message_type = 'video' -%}
|
| 343 |
+
{%- endif -%}
|
| 344 |
+
{%- endfor -%}
|
| 345 |
+
{%- endif -%}
|
| 346 |
+
{%- endset -%}
|
| 347 |
+
|
| 348 |
+
{{- captured_content -}}
|
| 349 |
+
{%- set has_content = captured_content | trim | length > 0 -%}
|
| 350 |
+
|
| 351 |
+
{%- if ns.prev_message_type == 'tool_call' and not ns_tr_out.flag -%}
|
| 352 |
+
{{- '<|tool_response>' -}}
|
| 353 |
+
{%- elif not (ns_tr_out.flag and not has_content) -%}
|
| 354 |
+
{{- '<turn|>\n' -}}
|
| 355 |
+
{%- endif -%}
|
| 356 |
+
{%- endif -%}
|
| 357 |
+
{%- endfor -%}
|
| 358 |
+
|
| 359 |
+
{%- if add_generation_prompt -%}
|
| 360 |
+
{%- if ns.prev_message_type != 'tool_response' and ns.prev_message_type != 'tool_call' -%}
|
| 361 |
+
{{- '<|turn>model\n' -}}
|
| 362 |
+
{%- endif -%}
|
| 363 |
+
{%- endif -%}
|
config.json
ADDED
|
@@ -0,0 +1,909 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"architectures": [
|
| 3 |
+
"Gemma4UnifiedForConditionalGeneration"
|
| 4 |
+
],
|
| 5 |
+
"audio_config": {
|
| 6 |
+
"_name_or_path": "",
|
| 7 |
+
"architectures": null,
|
| 8 |
+
"audio_embed_dim": 640,
|
| 9 |
+
"audio_samples_per_token": 640,
|
| 10 |
+
"chunk_size_feed_forward": 0,
|
| 11 |
+
"dtype": null,
|
| 12 |
+
"hidden_size": 640,
|
| 13 |
+
"id2label": {
|
| 14 |
+
"0": "LABEL_0",
|
| 15 |
+
"1": "LABEL_1"
|
| 16 |
+
},
|
| 17 |
+
"initializer_range": 0.02,
|
| 18 |
+
"is_encoder_decoder": false,
|
| 19 |
+
"label2id": {
|
| 20 |
+
"LABEL_0": 0,
|
| 21 |
+
"LABEL_1": 1
|
| 22 |
+
},
|
| 23 |
+
"model_type": "gemma4_unified_audio",
|
| 24 |
+
"output_attentions": false,
|
| 25 |
+
"output_hidden_states": false,
|
| 26 |
+
"output_proj_dims": 640,
|
| 27 |
+
"problem_type": null,
|
| 28 |
+
"return_dict": true,
|
| 29 |
+
"rms_norm_eps": 1e-06
|
| 30 |
+
},
|
| 31 |
+
"audio_token_id": 258881,
|
| 32 |
+
"boa_token_id": 256000,
|
| 33 |
+
"boi_token_id": 255999,
|
| 34 |
+
"dtype": "bfloat16",
|
| 35 |
+
"eoa_token_index": 258883,
|
| 36 |
+
"eoi_token_id": 258882,
|
| 37 |
+
"eos_token_id": [
|
| 38 |
+
1,
|
| 39 |
+
106
|
| 40 |
+
],
|
| 41 |
+
"image_token_id": 258880,
|
| 42 |
+
"initializer_range": 0.02,
|
| 43 |
+
"model_type": "gemma4_unified",
|
| 44 |
+
"text_config": {
|
| 45 |
+
"attention_bias": false,
|
| 46 |
+
"attention_dropout": 0.0,
|
| 47 |
+
"attention_k_eq_v": true,
|
| 48 |
+
"bos_token_id": 2,
|
| 49 |
+
"enable_moe_block": false,
|
| 50 |
+
"eos_token_id": 1,
|
| 51 |
+
"final_logit_softcapping": 30.0,
|
| 52 |
+
"global_head_dim": 512,
|
| 53 |
+
"head_dim": 256,
|
| 54 |
+
"hidden_activation": "gelu_pytorch_tanh",
|
| 55 |
+
"hidden_size": 3840,
|
| 56 |
+
"hidden_size_per_layer_input": 0,
|
| 57 |
+
"initializer_range": 0.02,
|
| 58 |
+
"intermediate_size": 15360,
|
| 59 |
+
"layer_types": [
|
| 60 |
+
"sliding_attention",
|
| 61 |
+
"sliding_attention",
|
| 62 |
+
"sliding_attention",
|
| 63 |
+
"sliding_attention",
|
| 64 |
+
"sliding_attention",
|
| 65 |
+
"full_attention",
|
| 66 |
+
"sliding_attention",
|
| 67 |
+
"sliding_attention",
|
| 68 |
+
"sliding_attention",
|
| 69 |
+
"sliding_attention",
|
| 70 |
+
"sliding_attention",
|
| 71 |
+
"full_attention",
|
| 72 |
+
"sliding_attention",
|
| 73 |
+
"sliding_attention",
|
| 74 |
+
"sliding_attention",
|
| 75 |
+
"sliding_attention",
|
| 76 |
+
"sliding_attention",
|
| 77 |
+
"full_attention",
|
| 78 |
+
"sliding_attention",
|
| 79 |
+
"sliding_attention",
|
| 80 |
+
"sliding_attention",
|
| 81 |
+
"sliding_attention",
|
| 82 |
+
"sliding_attention",
|
| 83 |
+
"full_attention",
|
| 84 |
+
"sliding_attention",
|
| 85 |
+
"sliding_attention",
|
| 86 |
+
"sliding_attention",
|
| 87 |
+
"sliding_attention",
|
| 88 |
+
"sliding_attention",
|
| 89 |
+
"full_attention",
|
| 90 |
+
"sliding_attention",
|
| 91 |
+
"sliding_attention",
|
| 92 |
+
"sliding_attention",
|
| 93 |
+
"sliding_attention",
|
| 94 |
+
"sliding_attention",
|
| 95 |
+
"full_attention",
|
| 96 |
+
"sliding_attention",
|
| 97 |
+
"sliding_attention",
|
| 98 |
+
"sliding_attention",
|
| 99 |
+
"sliding_attention",
|
| 100 |
+
"sliding_attention",
|
| 101 |
+
"full_attention",
|
| 102 |
+
"sliding_attention",
|
| 103 |
+
"sliding_attention",
|
| 104 |
+
"sliding_attention",
|
| 105 |
+
"sliding_attention",
|
| 106 |
+
"sliding_attention",
|
| 107 |
+
"full_attention"
|
| 108 |
+
],
|
| 109 |
+
"max_position_embeddings": 131072,
|
| 110 |
+
"model_type": "gemma4_unified_text",
|
| 111 |
+
"moe_intermediate_size": null,
|
| 112 |
+
"num_attention_heads": 16,
|
| 113 |
+
"num_experts": null,
|
| 114 |
+
"num_global_key_value_heads": 1,
|
| 115 |
+
"num_hidden_layers": 48,
|
| 116 |
+
"num_key_value_heads": 8,
|
| 117 |
+
"num_kv_shared_layers": 0,
|
| 118 |
+
"pad_token_id": 0,
|
| 119 |
+
"rms_norm_eps": 1e-06,
|
| 120 |
+
"rope_parameters": {
|
| 121 |
+
"full_attention": {
|
| 122 |
+
"partial_rotary_factor": 0.25,
|
| 123 |
+
"rope_theta": 1000000.0,
|
| 124 |
+
"rope_type": "proportional"
|
| 125 |
+
},
|
| 126 |
+
"sliding_attention": {
|
| 127 |
+
"rope_theta": 10000.0,
|
| 128 |
+
"rope_type": "default"
|
| 129 |
+
}
|
| 130 |
+
},
|
| 131 |
+
"sliding_window": 1024,
|
| 132 |
+
"tie_word_embeddings": true,
|
| 133 |
+
"top_k_experts": null,
|
| 134 |
+
"use_bidirectional_attention": "vision",
|
| 135 |
+
"use_cache": true,
|
| 136 |
+
"use_double_wide_mlp": false,
|
| 137 |
+
"vocab_size": 262144,
|
| 138 |
+
"vocab_size_per_layer_input": 262144
|
| 139 |
+
},
|
| 140 |
+
"tie_word_embeddings": true,
|
| 141 |
+
"transformers_version": "5.10.0.dev0",
|
| 142 |
+
"video_token_id": 258884,
|
| 143 |
+
"vision_config": {
|
| 144 |
+
"_name_or_path": "",
|
| 145 |
+
"architectures": null,
|
| 146 |
+
"chunk_size_feed_forward": 0,
|
| 147 |
+
"dtype": null,
|
| 148 |
+
"id2label": {
|
| 149 |
+
"0": "LABEL_0",
|
| 150 |
+
"1": "LABEL_1"
|
| 151 |
+
},
|
| 152 |
+
"initializer_range": 0.02,
|
| 153 |
+
"is_encoder_decoder": false,
|
| 154 |
+
"label2id": {
|
| 155 |
+
"LABEL_0": 0,
|
| 156 |
+
"LABEL_1": 1
|
| 157 |
+
},
|
| 158 |
+
"mm_embed_dim": 3840,
|
| 159 |
+
"mm_posemb_size": 1120,
|
| 160 |
+
"model_patch_size": 48,
|
| 161 |
+
"model_type": "gemma4_unified_vision",
|
| 162 |
+
"num_soft_tokens": 280,
|
| 163 |
+
"output_attentions": false,
|
| 164 |
+
"output_hidden_states": false,
|
| 165 |
+
"output_proj_dims": 3840,
|
| 166 |
+
"patch_size": 16,
|
| 167 |
+
"pooling_kernel_size": 3,
|
| 168 |
+
"problem_type": null,
|
| 169 |
+
"return_dict": true,
|
| 170 |
+
"rms_norm_eps": 1e-06
|
| 171 |
+
},
|
| 172 |
+
"weight_format": "jang_affine",
|
| 173 |
+
"quantization": {
|
| 174 |
+
"group_size": 32,
|
| 175 |
+
"bits": 8,
|
| 176 |
+
"mode": "affine",
|
| 177 |
+
"tied_embedding": "fp16_passthrough",
|
| 178 |
+
"language_model.model.layers.0.mlp.down_proj": {
|
| 179 |
+
"group_size": 32,
|
| 180 |
+
"bits": 4,
|
| 181 |
+
"mode": "affine"
|
| 182 |
+
},
|
| 183 |
+
"language_model.model.layers.0.mlp.gate_proj": {
|
| 184 |
+
"group_size": 32,
|
| 185 |
+
"bits": 4,
|
| 186 |
+
"mode": "affine"
|
| 187 |
+
},
|
| 188 |
+
"language_model.model.layers.0.mlp.up_proj": {
|
| 189 |
+
"group_size": 32,
|
| 190 |
+
"bits": 4,
|
| 191 |
+
"mode": "affine"
|
| 192 |
+
},
|
| 193 |
+
"language_model.model.layers.1.mlp.down_proj": {
|
| 194 |
+
"group_size": 32,
|
| 195 |
+
"bits": 4,
|
| 196 |
+
"mode": "affine"
|
| 197 |
+
},
|
| 198 |
+
"language_model.model.layers.1.mlp.gate_proj": {
|
| 199 |
+
"group_size": 32,
|
| 200 |
+
"bits": 4,
|
| 201 |
+
"mode": "affine"
|
| 202 |
+
},
|
| 203 |
+
"language_model.model.layers.1.mlp.up_proj": {
|
| 204 |
+
"group_size": 32,
|
| 205 |
+
"bits": 4,
|
| 206 |
+
"mode": "affine"
|
| 207 |
+
},
|
| 208 |
+
"language_model.model.layers.10.mlp.down_proj": {
|
| 209 |
+
"group_size": 32,
|
| 210 |
+
"bits": 4,
|
| 211 |
+
"mode": "affine"
|
| 212 |
+
},
|
| 213 |
+
"language_model.model.layers.10.mlp.gate_proj": {
|
| 214 |
+
"group_size": 32,
|
| 215 |
+
"bits": 4,
|
| 216 |
+
"mode": "affine"
|
| 217 |
+
},
|
| 218 |
+
"language_model.model.layers.10.mlp.up_proj": {
|
| 219 |
+
"group_size": 32,
|
| 220 |
+
"bits": 4,
|
| 221 |
+
"mode": "affine"
|
| 222 |
+
},
|
| 223 |
+
"language_model.model.layers.11.mlp.down_proj": {
|
| 224 |
+
"group_size": 32,
|
| 225 |
+
"bits": 4,
|
| 226 |
+
"mode": "affine"
|
| 227 |
+
},
|
| 228 |
+
"language_model.model.layers.11.mlp.gate_proj": {
|
| 229 |
+
"group_size": 32,
|
| 230 |
+
"bits": 4,
|
| 231 |
+
"mode": "affine"
|
| 232 |
+
},
|
| 233 |
+
"language_model.model.layers.11.mlp.up_proj": {
|
| 234 |
+
"group_size": 32,
|
| 235 |
+
"bits": 4,
|
| 236 |
+
"mode": "affine"
|
| 237 |
+
},
|
| 238 |
+
"language_model.model.layers.12.mlp.down_proj": {
|
| 239 |
+
"group_size": 32,
|
| 240 |
+
"bits": 4,
|
| 241 |
+
"mode": "affine"
|
| 242 |
+
},
|
| 243 |
+
"language_model.model.layers.12.mlp.gate_proj": {
|
| 244 |
+
"group_size": 32,
|
| 245 |
+
"bits": 4,
|
| 246 |
+
"mode": "affine"
|
| 247 |
+
},
|
| 248 |
+
"language_model.model.layers.12.mlp.up_proj": {
|
| 249 |
+
"group_size": 32,
|
| 250 |
+
"bits": 4,
|
| 251 |
+
"mode": "affine"
|
| 252 |
+
},
|
| 253 |
+
"language_model.model.layers.13.mlp.down_proj": {
|
| 254 |
+
"group_size": 32,
|
| 255 |
+
"bits": 4,
|
| 256 |
+
"mode": "affine"
|
| 257 |
+
},
|
| 258 |
+
"language_model.model.layers.13.mlp.gate_proj": {
|
| 259 |
+
"group_size": 32,
|
| 260 |
+
"bits": 4,
|
| 261 |
+
"mode": "affine"
|
| 262 |
+
},
|
| 263 |
+
"language_model.model.layers.13.mlp.up_proj": {
|
| 264 |
+
"group_size": 32,
|
| 265 |
+
"bits": 4,
|
| 266 |
+
"mode": "affine"
|
| 267 |
+
},
|
| 268 |
+
"language_model.model.layers.14.mlp.down_proj": {
|
| 269 |
+
"group_size": 32,
|
| 270 |
+
"bits": 4,
|
| 271 |
+
"mode": "affine"
|
| 272 |
+
},
|
| 273 |
+
"language_model.model.layers.14.mlp.gate_proj": {
|
| 274 |
+
"group_size": 32,
|
| 275 |
+
"bits": 4,
|
| 276 |
+
"mode": "affine"
|
| 277 |
+
},
|
| 278 |
+
"language_model.model.layers.14.mlp.up_proj": {
|
| 279 |
+
"group_size": 32,
|
| 280 |
+
"bits": 4,
|
| 281 |
+
"mode": "affine"
|
| 282 |
+
},
|
| 283 |
+
"language_model.model.layers.15.mlp.down_proj": {
|
| 284 |
+
"group_size": 32,
|
| 285 |
+
"bits": 4,
|
| 286 |
+
"mode": "affine"
|
| 287 |
+
},
|
| 288 |
+
"language_model.model.layers.15.mlp.gate_proj": {
|
| 289 |
+
"group_size": 32,
|
| 290 |
+
"bits": 4,
|
| 291 |
+
"mode": "affine"
|
| 292 |
+
},
|
| 293 |
+
"language_model.model.layers.15.mlp.up_proj": {
|
| 294 |
+
"group_size": 32,
|
| 295 |
+
"bits": 4,
|
| 296 |
+
"mode": "affine"
|
| 297 |
+
},
|
| 298 |
+
"language_model.model.layers.16.mlp.down_proj": {
|
| 299 |
+
"group_size": 32,
|
| 300 |
+
"bits": 4,
|
| 301 |
+
"mode": "affine"
|
| 302 |
+
},
|
| 303 |
+
"language_model.model.layers.16.mlp.gate_proj": {
|
| 304 |
+
"group_size": 32,
|
| 305 |
+
"bits": 4,
|
| 306 |
+
"mode": "affine"
|
| 307 |
+
},
|
| 308 |
+
"language_model.model.layers.16.mlp.up_proj": {
|
| 309 |
+
"group_size": 32,
|
| 310 |
+
"bits": 4,
|
| 311 |
+
"mode": "affine"
|
| 312 |
+
},
|
| 313 |
+
"language_model.model.layers.17.mlp.down_proj": {
|
| 314 |
+
"group_size": 32,
|
| 315 |
+
"bits": 4,
|
| 316 |
+
"mode": "affine"
|
| 317 |
+
},
|
| 318 |
+
"language_model.model.layers.17.mlp.gate_proj": {
|
| 319 |
+
"group_size": 32,
|
| 320 |
+
"bits": 4,
|
| 321 |
+
"mode": "affine"
|
| 322 |
+
},
|
| 323 |
+
"language_model.model.layers.17.mlp.up_proj": {
|
| 324 |
+
"group_size": 32,
|
| 325 |
+
"bits": 4,
|
| 326 |
+
"mode": "affine"
|
| 327 |
+
},
|
| 328 |
+
"language_model.model.layers.18.mlp.down_proj": {
|
| 329 |
+
"group_size": 32,
|
| 330 |
+
"bits": 4,
|
| 331 |
+
"mode": "affine"
|
| 332 |
+
},
|
| 333 |
+
"language_model.model.layers.18.mlp.gate_proj": {
|
| 334 |
+
"group_size": 32,
|
| 335 |
+
"bits": 4,
|
| 336 |
+
"mode": "affine"
|
| 337 |
+
},
|
| 338 |
+
"language_model.model.layers.18.mlp.up_proj": {
|
| 339 |
+
"group_size": 32,
|
| 340 |
+
"bits": 4,
|
| 341 |
+
"mode": "affine"
|
| 342 |
+
},
|
| 343 |
+
"language_model.model.layers.19.mlp.down_proj": {
|
| 344 |
+
"group_size": 32,
|
| 345 |
+
"bits": 4,
|
| 346 |
+
"mode": "affine"
|
| 347 |
+
},
|
| 348 |
+
"language_model.model.layers.19.mlp.gate_proj": {
|
| 349 |
+
"group_size": 32,
|
| 350 |
+
"bits": 4,
|
| 351 |
+
"mode": "affine"
|
| 352 |
+
},
|
| 353 |
+
"language_model.model.layers.19.mlp.up_proj": {
|
| 354 |
+
"group_size": 32,
|
| 355 |
+
"bits": 4,
|
| 356 |
+
"mode": "affine"
|
| 357 |
+
},
|
| 358 |
+
"language_model.model.layers.2.mlp.down_proj": {
|
| 359 |
+
"group_size": 32,
|
| 360 |
+
"bits": 4,
|
| 361 |
+
"mode": "affine"
|
| 362 |
+
},
|
| 363 |
+
"language_model.model.layers.2.mlp.gate_proj": {
|
| 364 |
+
"group_size": 32,
|
| 365 |
+
"bits": 4,
|
| 366 |
+
"mode": "affine"
|
| 367 |
+
},
|
| 368 |
+
"language_model.model.layers.2.mlp.up_proj": {
|
| 369 |
+
"group_size": 32,
|
| 370 |
+
"bits": 4,
|
| 371 |
+
"mode": "affine"
|
| 372 |
+
},
|
| 373 |
+
"language_model.model.layers.20.mlp.down_proj": {
|
| 374 |
+
"group_size": 32,
|
| 375 |
+
"bits": 4,
|
| 376 |
+
"mode": "affine"
|
| 377 |
+
},
|
| 378 |
+
"language_model.model.layers.20.mlp.gate_proj": {
|
| 379 |
+
"group_size": 32,
|
| 380 |
+
"bits": 4,
|
| 381 |
+
"mode": "affine"
|
| 382 |
+
},
|
| 383 |
+
"language_model.model.layers.20.mlp.up_proj": {
|
| 384 |
+
"group_size": 32,
|
| 385 |
+
"bits": 4,
|
| 386 |
+
"mode": "affine"
|
| 387 |
+
},
|
| 388 |
+
"language_model.model.layers.21.mlp.down_proj": {
|
| 389 |
+
"group_size": 32,
|
| 390 |
+
"bits": 4,
|
| 391 |
+
"mode": "affine"
|
| 392 |
+
},
|
| 393 |
+
"language_model.model.layers.21.mlp.gate_proj": {
|
| 394 |
+
"group_size": 32,
|
| 395 |
+
"bits": 4,
|
| 396 |
+
"mode": "affine"
|
| 397 |
+
},
|
| 398 |
+
"language_model.model.layers.21.mlp.up_proj": {
|
| 399 |
+
"group_size": 32,
|
| 400 |
+
"bits": 4,
|
| 401 |
+
"mode": "affine"
|
| 402 |
+
},
|
| 403 |
+
"language_model.model.layers.22.mlp.down_proj": {
|
| 404 |
+
"group_size": 32,
|
| 405 |
+
"bits": 4,
|
| 406 |
+
"mode": "affine"
|
| 407 |
+
},
|
| 408 |
+
"language_model.model.layers.22.mlp.gate_proj": {
|
| 409 |
+
"group_size": 32,
|
| 410 |
+
"bits": 4,
|
| 411 |
+
"mode": "affine"
|
| 412 |
+
},
|
| 413 |
+
"language_model.model.layers.22.mlp.up_proj": {
|
| 414 |
+
"group_size": 32,
|
| 415 |
+
"bits": 4,
|
| 416 |
+
"mode": "affine"
|
| 417 |
+
},
|
| 418 |
+
"language_model.model.layers.23.mlp.down_proj": {
|
| 419 |
+
"group_size": 32,
|
| 420 |
+
"bits": 4,
|
| 421 |
+
"mode": "affine"
|
| 422 |
+
},
|
| 423 |
+
"language_model.model.layers.23.mlp.gate_proj": {
|
| 424 |
+
"group_size": 32,
|
| 425 |
+
"bits": 4,
|
| 426 |
+
"mode": "affine"
|
| 427 |
+
},
|
| 428 |
+
"language_model.model.layers.23.mlp.up_proj": {
|
| 429 |
+
"group_size": 32,
|
| 430 |
+
"bits": 4,
|
| 431 |
+
"mode": "affine"
|
| 432 |
+
},
|
| 433 |
+
"language_model.model.layers.24.mlp.down_proj": {
|
| 434 |
+
"group_size": 32,
|
| 435 |
+
"bits": 4,
|
| 436 |
+
"mode": "affine"
|
| 437 |
+
},
|
| 438 |
+
"language_model.model.layers.24.mlp.gate_proj": {
|
| 439 |
+
"group_size": 32,
|
| 440 |
+
"bits": 4,
|
| 441 |
+
"mode": "affine"
|
| 442 |
+
},
|
| 443 |
+
"language_model.model.layers.24.mlp.up_proj": {
|
| 444 |
+
"group_size": 32,
|
| 445 |
+
"bits": 4,
|
| 446 |
+
"mode": "affine"
|
| 447 |
+
},
|
| 448 |
+
"language_model.model.layers.25.mlp.down_proj": {
|
| 449 |
+
"group_size": 32,
|
| 450 |
+
"bits": 4,
|
| 451 |
+
"mode": "affine"
|
| 452 |
+
},
|
| 453 |
+
"language_model.model.layers.25.mlp.gate_proj": {
|
| 454 |
+
"group_size": 32,
|
| 455 |
+
"bits": 4,
|
| 456 |
+
"mode": "affine"
|
| 457 |
+
},
|
| 458 |
+
"language_model.model.layers.25.mlp.up_proj": {
|
| 459 |
+
"group_size": 32,
|
| 460 |
+
"bits": 4,
|
| 461 |
+
"mode": "affine"
|
| 462 |
+
},
|
| 463 |
+
"language_model.model.layers.26.mlp.down_proj": {
|
| 464 |
+
"group_size": 32,
|
| 465 |
+
"bits": 4,
|
| 466 |
+
"mode": "affine"
|
| 467 |
+
},
|
| 468 |
+
"language_model.model.layers.26.mlp.gate_proj": {
|
| 469 |
+
"group_size": 32,
|
| 470 |
+
"bits": 4,
|
| 471 |
+
"mode": "affine"
|
| 472 |
+
},
|
| 473 |
+
"language_model.model.layers.26.mlp.up_proj": {
|
| 474 |
+
"group_size": 32,
|
| 475 |
+
"bits": 4,
|
| 476 |
+
"mode": "affine"
|
| 477 |
+
},
|
| 478 |
+
"language_model.model.layers.27.mlp.down_proj": {
|
| 479 |
+
"group_size": 32,
|
| 480 |
+
"bits": 4,
|
| 481 |
+
"mode": "affine"
|
| 482 |
+
},
|
| 483 |
+
"language_model.model.layers.27.mlp.gate_proj": {
|
| 484 |
+
"group_size": 32,
|
| 485 |
+
"bits": 4,
|
| 486 |
+
"mode": "affine"
|
| 487 |
+
},
|
| 488 |
+
"language_model.model.layers.27.mlp.up_proj": {
|
| 489 |
+
"group_size": 32,
|
| 490 |
+
"bits": 4,
|
| 491 |
+
"mode": "affine"
|
| 492 |
+
},
|
| 493 |
+
"language_model.model.layers.28.mlp.down_proj": {
|
| 494 |
+
"group_size": 32,
|
| 495 |
+
"bits": 4,
|
| 496 |
+
"mode": "affine"
|
| 497 |
+
},
|
| 498 |
+
"language_model.model.layers.28.mlp.gate_proj": {
|
| 499 |
+
"group_size": 32,
|
| 500 |
+
"bits": 4,
|
| 501 |
+
"mode": "affine"
|
| 502 |
+
},
|
| 503 |
+
"language_model.model.layers.28.mlp.up_proj": {
|
| 504 |
+
"group_size": 32,
|
| 505 |
+
"bits": 4,
|
| 506 |
+
"mode": "affine"
|
| 507 |
+
},
|
| 508 |
+
"language_model.model.layers.29.mlp.down_proj": {
|
| 509 |
+
"group_size": 32,
|
| 510 |
+
"bits": 4,
|
| 511 |
+
"mode": "affine"
|
| 512 |
+
},
|
| 513 |
+
"language_model.model.layers.29.mlp.gate_proj": {
|
| 514 |
+
"group_size": 32,
|
| 515 |
+
"bits": 4,
|
| 516 |
+
"mode": "affine"
|
| 517 |
+
},
|
| 518 |
+
"language_model.model.layers.29.mlp.up_proj": {
|
| 519 |
+
"group_size": 32,
|
| 520 |
+
"bits": 4,
|
| 521 |
+
"mode": "affine"
|
| 522 |
+
},
|
| 523 |
+
"language_model.model.layers.3.mlp.down_proj": {
|
| 524 |
+
"group_size": 32,
|
| 525 |
+
"bits": 4,
|
| 526 |
+
"mode": "affine"
|
| 527 |
+
},
|
| 528 |
+
"language_model.model.layers.3.mlp.gate_proj": {
|
| 529 |
+
"group_size": 32,
|
| 530 |
+
"bits": 4,
|
| 531 |
+
"mode": "affine"
|
| 532 |
+
},
|
| 533 |
+
"language_model.model.layers.3.mlp.up_proj": {
|
| 534 |
+
"group_size": 32,
|
| 535 |
+
"bits": 4,
|
| 536 |
+
"mode": "affine"
|
| 537 |
+
},
|
| 538 |
+
"language_model.model.layers.30.mlp.down_proj": {
|
| 539 |
+
"group_size": 32,
|
| 540 |
+
"bits": 4,
|
| 541 |
+
"mode": "affine"
|
| 542 |
+
},
|
| 543 |
+
"language_model.model.layers.30.mlp.gate_proj": {
|
| 544 |
+
"group_size": 32,
|
| 545 |
+
"bits": 4,
|
| 546 |
+
"mode": "affine"
|
| 547 |
+
},
|
| 548 |
+
"language_model.model.layers.30.mlp.up_proj": {
|
| 549 |
+
"group_size": 32,
|
| 550 |
+
"bits": 4,
|
| 551 |
+
"mode": "affine"
|
| 552 |
+
},
|
| 553 |
+
"language_model.model.layers.31.mlp.down_proj": {
|
| 554 |
+
"group_size": 32,
|
| 555 |
+
"bits": 4,
|
| 556 |
+
"mode": "affine"
|
| 557 |
+
},
|
| 558 |
+
"language_model.model.layers.31.mlp.gate_proj": {
|
| 559 |
+
"group_size": 32,
|
| 560 |
+
"bits": 4,
|
| 561 |
+
"mode": "affine"
|
| 562 |
+
},
|
| 563 |
+
"language_model.model.layers.31.mlp.up_proj": {
|
| 564 |
+
"group_size": 32,
|
| 565 |
+
"bits": 4,
|
| 566 |
+
"mode": "affine"
|
| 567 |
+
},
|
| 568 |
+
"language_model.model.layers.32.mlp.down_proj": {
|
| 569 |
+
"group_size": 32,
|
| 570 |
+
"bits": 4,
|
| 571 |
+
"mode": "affine"
|
| 572 |
+
},
|
| 573 |
+
"language_model.model.layers.32.mlp.gate_proj": {
|
| 574 |
+
"group_size": 32,
|
| 575 |
+
"bits": 4,
|
| 576 |
+
"mode": "affine"
|
| 577 |
+
},
|
| 578 |
+
"language_model.model.layers.32.mlp.up_proj": {
|
| 579 |
+
"group_size": 32,
|
| 580 |
+
"bits": 4,
|
| 581 |
+
"mode": "affine"
|
| 582 |
+
},
|
| 583 |
+
"language_model.model.layers.33.mlp.down_proj": {
|
| 584 |
+
"group_size": 32,
|
| 585 |
+
"bits": 4,
|
| 586 |
+
"mode": "affine"
|
| 587 |
+
},
|
| 588 |
+
"language_model.model.layers.33.mlp.gate_proj": {
|
| 589 |
+
"group_size": 32,
|
| 590 |
+
"bits": 4,
|
| 591 |
+
"mode": "affine"
|
| 592 |
+
},
|
| 593 |
+
"language_model.model.layers.33.mlp.up_proj": {
|
| 594 |
+
"group_size": 32,
|
| 595 |
+
"bits": 4,
|
| 596 |
+
"mode": "affine"
|
| 597 |
+
},
|
| 598 |
+
"language_model.model.layers.34.mlp.down_proj": {
|
| 599 |
+
"group_size": 32,
|
| 600 |
+
"bits": 4,
|
| 601 |
+
"mode": "affine"
|
| 602 |
+
},
|
| 603 |
+
"language_model.model.layers.34.mlp.gate_proj": {
|
| 604 |
+
"group_size": 32,
|
| 605 |
+
"bits": 4,
|
| 606 |
+
"mode": "affine"
|
| 607 |
+
},
|
| 608 |
+
"language_model.model.layers.34.mlp.up_proj": {
|
| 609 |
+
"group_size": 32,
|
| 610 |
+
"bits": 4,
|
| 611 |
+
"mode": "affine"
|
| 612 |
+
},
|
| 613 |
+
"language_model.model.layers.35.mlp.down_proj": {
|
| 614 |
+
"group_size": 32,
|
| 615 |
+
"bits": 4,
|
| 616 |
+
"mode": "affine"
|
| 617 |
+
},
|
| 618 |
+
"language_model.model.layers.35.mlp.gate_proj": {
|
| 619 |
+
"group_size": 32,
|
| 620 |
+
"bits": 4,
|
| 621 |
+
"mode": "affine"
|
| 622 |
+
},
|
| 623 |
+
"language_model.model.layers.35.mlp.up_proj": {
|
| 624 |
+
"group_size": 32,
|
| 625 |
+
"bits": 4,
|
| 626 |
+
"mode": "affine"
|
| 627 |
+
},
|
| 628 |
+
"language_model.model.layers.36.mlp.down_proj": {
|
| 629 |
+
"group_size": 32,
|
| 630 |
+
"bits": 4,
|
| 631 |
+
"mode": "affine"
|
| 632 |
+
},
|
| 633 |
+
"language_model.model.layers.36.mlp.gate_proj": {
|
| 634 |
+
"group_size": 32,
|
| 635 |
+
"bits": 4,
|
| 636 |
+
"mode": "affine"
|
| 637 |
+
},
|
| 638 |
+
"language_model.model.layers.36.mlp.up_proj": {
|
| 639 |
+
"group_size": 32,
|
| 640 |
+
"bits": 4,
|
| 641 |
+
"mode": "affine"
|
| 642 |
+
},
|
| 643 |
+
"language_model.model.layers.37.mlp.down_proj": {
|
| 644 |
+
"group_size": 32,
|
| 645 |
+
"bits": 4,
|
| 646 |
+
"mode": "affine"
|
| 647 |
+
},
|
| 648 |
+
"language_model.model.layers.37.mlp.gate_proj": {
|
| 649 |
+
"group_size": 32,
|
| 650 |
+
"bits": 4,
|
| 651 |
+
"mode": "affine"
|
| 652 |
+
},
|
| 653 |
+
"language_model.model.layers.37.mlp.up_proj": {
|
| 654 |
+
"group_size": 32,
|
| 655 |
+
"bits": 4,
|
| 656 |
+
"mode": "affine"
|
| 657 |
+
},
|
| 658 |
+
"language_model.model.layers.38.mlp.down_proj": {
|
| 659 |
+
"group_size": 32,
|
| 660 |
+
"bits": 4,
|
| 661 |
+
"mode": "affine"
|
| 662 |
+
},
|
| 663 |
+
"language_model.model.layers.38.mlp.gate_proj": {
|
| 664 |
+
"group_size": 32,
|
| 665 |
+
"bits": 4,
|
| 666 |
+
"mode": "affine"
|
| 667 |
+
},
|
| 668 |
+
"language_model.model.layers.38.mlp.up_proj": {
|
| 669 |
+
"group_size": 32,
|
| 670 |
+
"bits": 4,
|
| 671 |
+
"mode": "affine"
|
| 672 |
+
},
|
| 673 |
+
"language_model.model.layers.39.mlp.down_proj": {
|
| 674 |
+
"group_size": 32,
|
| 675 |
+
"bits": 4,
|
| 676 |
+
"mode": "affine"
|
| 677 |
+
},
|
| 678 |
+
"language_model.model.layers.39.mlp.gate_proj": {
|
| 679 |
+
"group_size": 32,
|
| 680 |
+
"bits": 4,
|
| 681 |
+
"mode": "affine"
|
| 682 |
+
},
|
| 683 |
+
"language_model.model.layers.39.mlp.up_proj": {
|
| 684 |
+
"group_size": 32,
|
| 685 |
+
"bits": 4,
|
| 686 |
+
"mode": "affine"
|
| 687 |
+
},
|
| 688 |
+
"language_model.model.layers.4.mlp.down_proj": {
|
| 689 |
+
"group_size": 32,
|
| 690 |
+
"bits": 4,
|
| 691 |
+
"mode": "affine"
|
| 692 |
+
},
|
| 693 |
+
"language_model.model.layers.4.mlp.gate_proj": {
|
| 694 |
+
"group_size": 32,
|
| 695 |
+
"bits": 4,
|
| 696 |
+
"mode": "affine"
|
| 697 |
+
},
|
| 698 |
+
"language_model.model.layers.4.mlp.up_proj": {
|
| 699 |
+
"group_size": 32,
|
| 700 |
+
"bits": 4,
|
| 701 |
+
"mode": "affine"
|
| 702 |
+
},
|
| 703 |
+
"language_model.model.layers.40.mlp.down_proj": {
|
| 704 |
+
"group_size": 32,
|
| 705 |
+
"bits": 4,
|
| 706 |
+
"mode": "affine"
|
| 707 |
+
},
|
| 708 |
+
"language_model.model.layers.40.mlp.gate_proj": {
|
| 709 |
+
"group_size": 32,
|
| 710 |
+
"bits": 4,
|
| 711 |
+
"mode": "affine"
|
| 712 |
+
},
|
| 713 |
+
"language_model.model.layers.40.mlp.up_proj": {
|
| 714 |
+
"group_size": 32,
|
| 715 |
+
"bits": 4,
|
| 716 |
+
"mode": "affine"
|
| 717 |
+
},
|
| 718 |
+
"language_model.model.layers.41.mlp.down_proj": {
|
| 719 |
+
"group_size": 32,
|
| 720 |
+
"bits": 4,
|
| 721 |
+
"mode": "affine"
|
| 722 |
+
},
|
| 723 |
+
"language_model.model.layers.41.mlp.gate_proj": {
|
| 724 |
+
"group_size": 32,
|
| 725 |
+
"bits": 4,
|
| 726 |
+
"mode": "affine"
|
| 727 |
+
},
|
| 728 |
+
"language_model.model.layers.41.mlp.up_proj": {
|
| 729 |
+
"group_size": 32,
|
| 730 |
+
"bits": 4,
|
| 731 |
+
"mode": "affine"
|
| 732 |
+
},
|
| 733 |
+
"language_model.model.layers.42.mlp.down_proj": {
|
| 734 |
+
"group_size": 32,
|
| 735 |
+
"bits": 4,
|
| 736 |
+
"mode": "affine"
|
| 737 |
+
},
|
| 738 |
+
"language_model.model.layers.42.mlp.gate_proj": {
|
| 739 |
+
"group_size": 32,
|
| 740 |
+
"bits": 4,
|
| 741 |
+
"mode": "affine"
|
| 742 |
+
},
|
| 743 |
+
"language_model.model.layers.42.mlp.up_proj": {
|
| 744 |
+
"group_size": 32,
|
| 745 |
+
"bits": 4,
|
| 746 |
+
"mode": "affine"
|
| 747 |
+
},
|
| 748 |
+
"language_model.model.layers.43.mlp.down_proj": {
|
| 749 |
+
"group_size": 32,
|
| 750 |
+
"bits": 4,
|
| 751 |
+
"mode": "affine"
|
| 752 |
+
},
|
| 753 |
+
"language_model.model.layers.43.mlp.gate_proj": {
|
| 754 |
+
"group_size": 32,
|
| 755 |
+
"bits": 4,
|
| 756 |
+
"mode": "affine"
|
| 757 |
+
},
|
| 758 |
+
"language_model.model.layers.43.mlp.up_proj": {
|
| 759 |
+
"group_size": 32,
|
| 760 |
+
"bits": 4,
|
| 761 |
+
"mode": "affine"
|
| 762 |
+
},
|
| 763 |
+
"language_model.model.layers.44.mlp.down_proj": {
|
| 764 |
+
"group_size": 32,
|
| 765 |
+
"bits": 4,
|
| 766 |
+
"mode": "affine"
|
| 767 |
+
},
|
| 768 |
+
"language_model.model.layers.44.mlp.gate_proj": {
|
| 769 |
+
"group_size": 32,
|
| 770 |
+
"bits": 4,
|
| 771 |
+
"mode": "affine"
|
| 772 |
+
},
|
| 773 |
+
"language_model.model.layers.44.mlp.up_proj": {
|
| 774 |
+
"group_size": 32,
|
| 775 |
+
"bits": 4,
|
| 776 |
+
"mode": "affine"
|
| 777 |
+
},
|
| 778 |
+
"language_model.model.layers.45.mlp.down_proj": {
|
| 779 |
+
"group_size": 32,
|
| 780 |
+
"bits": 4,
|
| 781 |
+
"mode": "affine"
|
| 782 |
+
},
|
| 783 |
+
"language_model.model.layers.45.mlp.gate_proj": {
|
| 784 |
+
"group_size": 32,
|
| 785 |
+
"bits": 4,
|
| 786 |
+
"mode": "affine"
|
| 787 |
+
},
|
| 788 |
+
"language_model.model.layers.45.mlp.up_proj": {
|
| 789 |
+
"group_size": 32,
|
| 790 |
+
"bits": 4,
|
| 791 |
+
"mode": "affine"
|
| 792 |
+
},
|
| 793 |
+
"language_model.model.layers.46.mlp.down_proj": {
|
| 794 |
+
"group_size": 32,
|
| 795 |
+
"bits": 4,
|
| 796 |
+
"mode": "affine"
|
| 797 |
+
},
|
| 798 |
+
"language_model.model.layers.46.mlp.gate_proj": {
|
| 799 |
+
"group_size": 32,
|
| 800 |
+
"bits": 4,
|
| 801 |
+
"mode": "affine"
|
| 802 |
+
},
|
| 803 |
+
"language_model.model.layers.46.mlp.up_proj": {
|
| 804 |
+
"group_size": 32,
|
| 805 |
+
"bits": 4,
|
| 806 |
+
"mode": "affine"
|
| 807 |
+
},
|
| 808 |
+
"language_model.model.layers.47.mlp.down_proj": {
|
| 809 |
+
"group_size": 32,
|
| 810 |
+
"bits": 4,
|
| 811 |
+
"mode": "affine"
|
| 812 |
+
},
|
| 813 |
+
"language_model.model.layers.47.mlp.gate_proj": {
|
| 814 |
+
"group_size": 32,
|
| 815 |
+
"bits": 4,
|
| 816 |
+
"mode": "affine"
|
| 817 |
+
},
|
| 818 |
+
"language_model.model.layers.47.mlp.up_proj": {
|
| 819 |
+
"group_size": 32,
|
| 820 |
+
"bits": 4,
|
| 821 |
+
"mode": "affine"
|
| 822 |
+
},
|
| 823 |
+
"language_model.model.layers.5.mlp.down_proj": {
|
| 824 |
+
"group_size": 32,
|
| 825 |
+
"bits": 4,
|
| 826 |
+
"mode": "affine"
|
| 827 |
+
},
|
| 828 |
+
"language_model.model.layers.5.mlp.gate_proj": {
|
| 829 |
+
"group_size": 32,
|
| 830 |
+
"bits": 4,
|
| 831 |
+
"mode": "affine"
|
| 832 |
+
},
|
| 833 |
+
"language_model.model.layers.5.mlp.up_proj": {
|
| 834 |
+
"group_size": 32,
|
| 835 |
+
"bits": 4,
|
| 836 |
+
"mode": "affine"
|
| 837 |
+
},
|
| 838 |
+
"language_model.model.layers.6.mlp.down_proj": {
|
| 839 |
+
"group_size": 32,
|
| 840 |
+
"bits": 4,
|
| 841 |
+
"mode": "affine"
|
| 842 |
+
},
|
| 843 |
+
"language_model.model.layers.6.mlp.gate_proj": {
|
| 844 |
+
"group_size": 32,
|
| 845 |
+
"bits": 4,
|
| 846 |
+
"mode": "affine"
|
| 847 |
+
},
|
| 848 |
+
"language_model.model.layers.6.mlp.up_proj": {
|
| 849 |
+
"group_size": 32,
|
| 850 |
+
"bits": 4,
|
| 851 |
+
"mode": "affine"
|
| 852 |
+
},
|
| 853 |
+
"language_model.model.layers.7.mlp.down_proj": {
|
| 854 |
+
"group_size": 32,
|
| 855 |
+
"bits": 4,
|
| 856 |
+
"mode": "affine"
|
| 857 |
+
},
|
| 858 |
+
"language_model.model.layers.7.mlp.gate_proj": {
|
| 859 |
+
"group_size": 32,
|
| 860 |
+
"bits": 4,
|
| 861 |
+
"mode": "affine"
|
| 862 |
+
},
|
| 863 |
+
"language_model.model.layers.7.mlp.up_proj": {
|
| 864 |
+
"group_size": 32,
|
| 865 |
+
"bits": 4,
|
| 866 |
+
"mode": "affine"
|
| 867 |
+
},
|
| 868 |
+
"language_model.model.layers.8.mlp.down_proj": {
|
| 869 |
+
"group_size": 32,
|
| 870 |
+
"bits": 4,
|
| 871 |
+
"mode": "affine"
|
| 872 |
+
},
|
| 873 |
+
"language_model.model.layers.8.mlp.gate_proj": {
|
| 874 |
+
"group_size": 32,
|
| 875 |
+
"bits": 4,
|
| 876 |
+
"mode": "affine"
|
| 877 |
+
},
|
| 878 |
+
"language_model.model.layers.8.mlp.up_proj": {
|
| 879 |
+
"group_size": 32,
|
| 880 |
+
"bits": 4,
|
| 881 |
+
"mode": "affine"
|
| 882 |
+
},
|
| 883 |
+
"language_model.model.layers.9.mlp.down_proj": {
|
| 884 |
+
"group_size": 32,
|
| 885 |
+
"bits": 4,
|
| 886 |
+
"mode": "affine"
|
| 887 |
+
},
|
| 888 |
+
"language_model.model.layers.9.mlp.gate_proj": {
|
| 889 |
+
"group_size": 32,
|
| 890 |
+
"bits": 4,
|
| 891 |
+
"mode": "affine"
|
| 892 |
+
},
|
| 893 |
+
"language_model.model.layers.9.mlp.up_proj": {
|
| 894 |
+
"group_size": 32,
|
| 895 |
+
"bits": 4,
|
| 896 |
+
"mode": "affine"
|
| 897 |
+
}
|
| 898 |
+
},
|
| 899 |
+
"capabilities": {
|
| 900 |
+
"reasoning_parser": "gemma4",
|
| 901 |
+
"tool_parser": "gemma4",
|
| 902 |
+
"think_in_template": false,
|
| 903 |
+
"supports_tools": true,
|
| 904 |
+
"supports_thinking": true,
|
| 905 |
+
"family": "gemma4",
|
| 906 |
+
"modality": "vision",
|
| 907 |
+
"cache_type": "kv"
|
| 908 |
+
}
|
| 909 |
+
}
|
dealign_logo.png
ADDED
|
dealign_mascot.png
ADDED
|
generation_config.json
ADDED
|
@@ -0,0 +1,18 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"bos_token_id": 2,
|
| 3 |
+
"do_sample": true,
|
| 4 |
+
"eos_token_id": [
|
| 5 |
+
1,
|
| 6 |
+
106,
|
| 7 |
+
50
|
| 8 |
+
],
|
| 9 |
+
"pad_token_id": 0,
|
| 10 |
+
"suppress_tokens": [
|
| 11 |
+
258883,
|
| 12 |
+
258882
|
| 13 |
+
],
|
| 14 |
+
"temperature": 1.0,
|
| 15 |
+
"top_k": 64,
|
| 16 |
+
"top_p": 0.95,
|
| 17 |
+
"transformers_version": "5.10.0.dev0"
|
| 18 |
+
}
|
jang_config.json
ADDED
|
@@ -0,0 +1,55 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"version": 2,
|
| 3 |
+
"weight_format": "jang_affine",
|
| 4 |
+
"profile": "JANG_4M",
|
| 5 |
+
"source_model": {
|
| 6 |
+
"name": "gemma-4-12B-it",
|
| 7 |
+
"architecture": "gemma4_unified_text"
|
| 8 |
+
},
|
| 9 |
+
"has_vision": true,
|
| 10 |
+
"has_audio": true,
|
| 11 |
+
"quantization": {
|
| 12 |
+
"method": "jang_affine",
|
| 13 |
+
"quantization_backend": "mx.quantize",
|
| 14 |
+
"mode": "affine",
|
| 15 |
+
"group_size": 32,
|
| 16 |
+
"tier_bits": {
|
| 17 |
+
"attention": 8,
|
| 18 |
+
"mlp": 4,
|
| 19 |
+
"embed": 16
|
| 20 |
+
},
|
| 21 |
+
"tied_embedding": "fp16_passthrough",
|
| 22 |
+
"norm_convention": "gemma4_scale_shift_zero",
|
| 23 |
+
"multimodal": "fp16_passthrough_embedders_early_fusion",
|
| 24 |
+
"mtp_policy": "none",
|
| 25 |
+
"per_module_override_count": 144,
|
| 26 |
+
"passthrough_tensor_count": 349
|
| 27 |
+
},
|
| 28 |
+
"runtime": {
|
| 29 |
+
"total_weight_bytes": 10135442880,
|
| 30 |
+
"total_weight_gb": 9.44,
|
| 31 |
+
"attention": "hybrid_swa_full_5to1",
|
| 32 |
+
"sliding_window": 1024,
|
| 33 |
+
"attention_k_eq_v_on_full_layers": true,
|
| 34 |
+
"full_attention_layers": [
|
| 35 |
+
5,
|
| 36 |
+
11,
|
| 37 |
+
17,
|
| 38 |
+
23,
|
| 39 |
+
29,
|
| 40 |
+
35,
|
| 41 |
+
41,
|
| 42 |
+
47
|
| 43 |
+
]
|
| 44 |
+
},
|
| 45 |
+
"capabilities": {
|
| 46 |
+
"reasoning_parser": "gemma4",
|
| 47 |
+
"tool_parser": "gemma4",
|
| 48 |
+
"think_in_template": false,
|
| 49 |
+
"supports_tools": true,
|
| 50 |
+
"supports_thinking": true,
|
| 51 |
+
"family": "gemma4",
|
| 52 |
+
"modality": "vision",
|
| 53 |
+
"cache_type": "kv"
|
| 54 |
+
}
|
| 55 |
+
}
|
model-00001-of-00010.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:951a2a2c026b9810575744a815cdf33dde57a1547ecff8e46d467745fa286555
|
| 3 |
+
size 2047672664
|
model-00002-of-00010.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:437a4827c3de3f50ba5017985d48b629ac61e65871ebe933c702347facc9e1fd
|
| 3 |
+
size 1001192692
|
model-00003-of-00010.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:67277789451bf8b1699707a3073d62e8c63977797eabd2c9cedc6c1d946ff94d
|
| 3 |
+
size 1002176044
|
model-00004-of-00010.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d727ecffda370546fece49d0b6a327e66f941423014c16dc6a289d3e8d5f2d80
|
| 3 |
+
size 1010040320
|
model-00005-of-00010.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8158819add511610ccfc4631c625f80f63fa9445bb0b3193ebb9182c6341b681
|
| 3 |
+
size 1013972490
|
model-00006-of-00010.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:78912bfd14bfd2eb3dbc0f5717222aa63fe7ec51ff7872aea57c1e49c88f276e
|
| 3 |
+
size 1014955699
|
model-00007-of-00010.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:28700ff2796fa827ac8bbf2ec51cce76d98c9ee60c4179353ae6b690e652c04a
|
| 3 |
+
size 1013734310
|
model-00008-of-00010.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f1bbbe31ed9fd14f27717a8214598e2baa235e456fe3228bcc2fee7aa5bf2674
|
| 3 |
+
size 1002176052
|
model-00009-of-00010.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7b3eb166f2824e2fba6b8f830078b9103526d230e854f01c3ee8e62f406db43c
|
| 3 |
+
size 1045683690
|
model-00010-of-00010.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cda6679df16fa313565c96cd1528a56a989a292a4436b141d464e3a518a15e99
|
| 3 |
+
size 17262256
|
model.safetensors.index.json
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
processor_config.json
ADDED
|
@@ -0,0 +1,62 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"audio_ms_per_token": 40,
|
| 3 |
+
"audio_seq_length": 750,
|
| 4 |
+
"feature_extractor": {
|
| 5 |
+
"audio_samples_per_token": 640,
|
| 6 |
+
"feature_extractor_type": "Gemma4UnifiedAudioFeatureExtractor",
|
| 7 |
+
"feature_size": 640,
|
| 8 |
+
"padding_side": "right",
|
| 9 |
+
"padding_value": 0.0,
|
| 10 |
+
"return_attention_mask": true,
|
| 11 |
+
"sampling_rate": 16000
|
| 12 |
+
},
|
| 13 |
+
"image_processor": {
|
| 14 |
+
"do_convert_rgb": true,
|
| 15 |
+
"do_normalize": false,
|
| 16 |
+
"do_rescale": true,
|
| 17 |
+
"do_resize": true,
|
| 18 |
+
"image_mean": [
|
| 19 |
+
0.0,
|
| 20 |
+
0.0,
|
| 21 |
+
0.0
|
| 22 |
+
],
|
| 23 |
+
"image_processor_type": "Gemma4UnifiedImageProcessor",
|
| 24 |
+
"image_std": [
|
| 25 |
+
1.0,
|
| 26 |
+
1.0,
|
| 27 |
+
1.0
|
| 28 |
+
],
|
| 29 |
+
"max_soft_tokens": 280,
|
| 30 |
+
"patch_size": 16,
|
| 31 |
+
"pooling_kernel_size": 3,
|
| 32 |
+
"resample": 3,
|
| 33 |
+
"rescale_factor": 0.00392156862745098
|
| 34 |
+
},
|
| 35 |
+
"image_seq_length": 280,
|
| 36 |
+
"processor_class": "Gemma4UnifiedProcessor",
|
| 37 |
+
"video_processor": {
|
| 38 |
+
"do_convert_rgb": true,
|
| 39 |
+
"do_normalize": true,
|
| 40 |
+
"do_rescale": true,
|
| 41 |
+
"do_resize": true,
|
| 42 |
+
"do_sample_frames": true,
|
| 43 |
+
"image_mean": [
|
| 44 |
+
0.0,
|
| 45 |
+
0.0,
|
| 46 |
+
0.0
|
| 47 |
+
],
|
| 48 |
+
"image_std": [
|
| 49 |
+
1.0,
|
| 50 |
+
1.0,
|
| 51 |
+
1.0
|
| 52 |
+
],
|
| 53 |
+
"max_soft_tokens": 70,
|
| 54 |
+
"num_frames": 32,
|
| 55 |
+
"patch_size": 16,
|
| 56 |
+
"pooling_kernel_size": 3,
|
| 57 |
+
"resample": 3,
|
| 58 |
+
"rescale_factor": 0.00392156862745098,
|
| 59 |
+
"return_metadata": false,
|
| 60 |
+
"video_processor_type": "Gemma4UnifiedVideoProcessor"
|
| 61 |
+
}
|
| 62 |
+
}
|
tokenizer.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:cc8d3a0ce36466ccc1278bf987df5f71db1719b9ca6b4118264f45cb627bfe0f
|
| 3 |
+
size 32169626
|
tokenizer_config.json
ADDED
|
@@ -0,0 +1,75 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"audio_token": "<|audio|>",
|
| 3 |
+
"backend": "tokenizers",
|
| 4 |
+
"boa_token": "<|audio>",
|
| 5 |
+
"boi_token": "<|image>",
|
| 6 |
+
"bos_token": "<bos>",
|
| 7 |
+
"eoa_token": "<audio|>",
|
| 8 |
+
"eoc_token": "<channel|>",
|
| 9 |
+
"eoi_token": "<image|>",
|
| 10 |
+
"eos_token": "<eos>",
|
| 11 |
+
"eot_token": "<turn|>",
|
| 12 |
+
"escape_token": "<|\"|>",
|
| 13 |
+
"etc_token": "<tool_call|>",
|
| 14 |
+
"etd_token": "<tool|>",
|
| 15 |
+
"etr_token": "<tool_response|>",
|
| 16 |
+
"extra_special_tokens": [
|
| 17 |
+
"<|video|>"
|
| 18 |
+
],
|
| 19 |
+
"image_token": "<|image|>",
|
| 20 |
+
"mask_token": "<mask>",
|
| 21 |
+
"model_max_length": 1000000000000000019884624838656,
|
| 22 |
+
"pad_token": "<pad>",
|
| 23 |
+
"padding_side": "left",
|
| 24 |
+
"processor_class": "Gemma4UnifiedProcessor",
|
| 25 |
+
"response_schema": {
|
| 26 |
+
"properties": {
|
| 27 |
+
"content": {
|
| 28 |
+
"type": "string"
|
| 29 |
+
},
|
| 30 |
+
"role": {
|
| 31 |
+
"const": "assistant"
|
| 32 |
+
},
|
| 33 |
+
"thinking": {
|
| 34 |
+
"type": "string"
|
| 35 |
+
},
|
| 36 |
+
"tool_calls": {
|
| 37 |
+
"items": {
|
| 38 |
+
"properties": {
|
| 39 |
+
"function": {
|
| 40 |
+
"properties": {
|
| 41 |
+
"arguments": {
|
| 42 |
+
"additionalProperties": {},
|
| 43 |
+
"type": "object",
|
| 44 |
+
"x-parser": "gemma4-tool-call"
|
| 45 |
+
},
|
| 46 |
+
"name": {
|
| 47 |
+
"type": "string"
|
| 48 |
+
}
|
| 49 |
+
},
|
| 50 |
+
"type": "object",
|
| 51 |
+
"x-regex": "call\\:(?P<name>\\w+)(?P<arguments>\\{.*\\})"
|
| 52 |
+
},
|
| 53 |
+
"type": {
|
| 54 |
+
"const": "function"
|
| 55 |
+
}
|
| 56 |
+
},
|
| 57 |
+
"type": "object"
|
| 58 |
+
},
|
| 59 |
+
"type": "array",
|
| 60 |
+
"x-regex-iterator": "<\\|tool_call>(.*?)<tool_call\\|>"
|
| 61 |
+
}
|
| 62 |
+
},
|
| 63 |
+
"type": "object",
|
| 64 |
+
"x-regex": "(\\<\\|channel\\>thought\\n(?P<thinking>.*?)\\<channel\\|\\>)?(?P<tool_calls>\\<\\|tool_call\\>.*\\<tool_call\\|\\>)?(?P<content>(?:(?!\\<turn\\|\\>)(?!\\<\\|tool_response\\>).)+)?(?:\\<turn\\|\\>|\\<\\|tool_response\\>)?"
|
| 65 |
+
},
|
| 66 |
+
"soc_token": "<|channel>",
|
| 67 |
+
"sot_token": "<|turn>",
|
| 68 |
+
"stc_token": "<|tool_call>",
|
| 69 |
+
"std_token": "<|tool>",
|
| 70 |
+
"str_token": "<|tool_response>",
|
| 71 |
+
"think_token": "<|think|>",
|
| 72 |
+
"tokenizer_class": "GemmaTokenizer",
|
| 73 |
+
"unk_token": "<unk>",
|
| 74 |
+
"chat_template": "{%- macro format_parameters(properties, required, filter_keys=false) -%}\n {%- set standard_keys = ['description', 'type', 'properties', 'required', 'nullable'] -%}\n {%- set ns = namespace(found_first=false) -%}\n {%- for key, value in properties | dictsort -%}\n {%- set add_comma = false -%}\n {%- if not filter_keys or key not in standard_keys -%}\n {%- if ns.found_first %},{% endif -%}\n {%- set ns.found_first = true -%}\n {{ key }}:{\n {%- if value['description'] -%}\n description:<|\"|>{{ value['description'] }}<|\"|>\n {%- set add_comma = true -%}\n {%- endif -%}\n {%- if value['type'] | upper == 'STRING' -%}\n {%- if value['enum'] -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n enum:{{ format_argument(value['enum']) }}\n {%- endif -%}\n {%- elif value['type'] | upper == 'ARRAY' -%}\n {%- if value['items'] is mapping and value['items'] -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n items:{\n {%- set ns_items = namespace(found_first=false) -%}\n {%- for item_key, item_value in value['items'] | dictsort -%}\n {%- if item_value is not none -%}\n {%- if ns_items.found_first %},{% endif -%}\n {%- set ns_items.found_first = true -%}\n {%- if item_key == 'properties' -%}\n properties:{\n {%- if item_value is mapping -%}\n {{- format_parameters(item_value, value['items']['required'] | default([])) -}}\n {%- endif -%}\n }\n {%- elif item_key == 'required' -%}\n required:[\n {%- for req_item in item_value -%}\n <|\"|>{{- req_item -}}<|\"|>\n {%- if not loop.last %},{% endif -%}\n {%- endfor -%}\n ]\n {%- elif item_key == 'type' -%}\n {%- if item_value is string -%}\n type:{{ format_argument(item_value | upper) }}\n {%- else -%}\n type:{{ format_argument(item_value | map('upper') | list) }}\n {%- endif -%}\n {%- else -%}\n {{ item_key }}:{{ format_argument(item_value) }}\n {%- endif -%}\n {%- endif -%}\n {%- endfor -%}\n }\n {%- endif -%}\n {%- endif -%}\n {%- if value['nullable'] %}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n nullable:true\n {%- endif -%}\n {%- if value['type'] | upper == 'OBJECT' -%}\n {%- if value['properties'] is defined and value['properties'] is mapping -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n properties:{\n {{- format_parameters(value['properties'], value['required'] | default([])) -}}\n }\n {%- elif value is mapping -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n properties:{\n {{- format_parameters(value, value['required'] | default([]), filter_keys=true) -}}\n }\n {%- endif -%}\n {%- if value['required'] -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n required:[\n {%- for item in value['required'] | default([]) -%}\n <|\"|>{{- item -}}<|\"|>\n {%- if not loop.last %},{% endif -%}\n {%- endfor -%}\n ]\n {%- endif -%}\n {%- endif -%}\n {%- if add_comma %},{%- else -%} {%- set add_comma = true -%} {% endif -%}\n type:<|\"|>{{ value['type'] | upper }}<|\"|>}\n {%- endif -%}\n {%- endfor -%}\n{%- endmacro -%}\n{%- macro format_function_declaration(tool_data) -%}\n declaration:{{- tool_data['function']['name'] -}}{description:<|\"|>{{- tool_data['function']['description'] -}}<|\"|>\n {%- set params = tool_data['function']['parameters'] -%}\n {%- if params -%}\n ,parameters:{\n {%- if params['properties'] -%}\n properties:{ {{- format_parameters(params['properties'], params['required']) -}} },\n {%- endif -%}\n {%- if params['required'] -%}\n required:[\n {%- for item in params['required'] -%}\n <|\"|>{{- item -}}<|\"|>\n {{- ',' if not loop.last -}}\n {%- endfor -%}\n ],\n {%- endif -%}\n {%- if params['type'] -%}\n type:<|\"|>{{- params['type'] | upper -}}<|\"|>}\n {%- endif -%}\n {%- endif -%}\n {%- if 'response' in tool_data['function'] -%}\n {%- set response_declaration = tool_data['function']['response'] -%}\n ,response:{\n {%- if response_declaration['description'] -%}\n description:<|\"|>{{- response_declaration['description'] -}}<|\"|>,\n {%- endif -%}\n {%- if response_declaration['type'] | upper == 'OBJECT' -%}\n type:<|\"|>{{- response_declaration['type'] | upper -}}<|\"|>}\n {%- endif -%}\n {%- endif -%}\n }\n{%- endmacro -%}\n{%- macro format_argument(argument, escape_keys=True) -%}\n {%- if argument is string -%}\n {{- '<|\"|>' + argument + '<|\"|>' -}}\n {%- elif argument is boolean -%}\n {{- 'true' if argument else 'false' -}}\n {%- elif argument is mapping -%}\n {{- '{' -}}\n {%- set ns = namespace(found_first=false) -%}\n {%- for key, value in argument | dictsort -%}\n {%- if ns.found_first %},{% endif -%}\n {%- set ns.found_first = true -%}\n {%- if escape_keys -%}\n {{- '<|\"|>' + key + '<|\"|>' -}}\n {%- else -%}\n {{- key -}}\n {%- endif -%}\n :{{- format_argument(value, escape_keys=escape_keys) -}}\n {%- endfor -%}\n {{- '}' -}}\n {%- elif argument is sequence -%}\n {{- '[' -}}\n {%- for item in argument -%}\n {{- format_argument(item, escape_keys=escape_keys) -}}\n {%- if not loop.last %},{% endif -%}\n {%- endfor -%}\n {{- ']' -}}\n {%- else -%}\n {{- argument -}}\n {%- endif -%}\n{%- endmacro -%}\n{%- macro strip_thinking(text) -%}\n {%- set ns = namespace(result='') -%}\n {%- for part in text.split('<channel|>') -%}\n {%- if '<|channel>' in part -%}\n {%- set ns.result = ns.result + part.split('<|channel>')[0] -%}\n {%- else -%}\n {%- set ns.result = ns.result + part -%}\n {%- endif -%}\n {%- endfor -%}\n {{- ns.result | trim -}}\n{%- endmacro -%}\n\n{%- macro format_tool_response_block(tool_name, response) -%}\n {{- '<|tool_response>' -}}\n {%- if response is mapping -%}\n {{- 'response:' + tool_name + '{' -}}\n {%- for key, value in response | dictsort -%}\n {{- key -}}:{{- format_argument(value, escape_keys=False) -}}\n {%- if not loop.last %},{% endif -%}\n {%- endfor -%}\n {{- '}' -}}\n {%- else -%}\n {{- 'response:' + tool_name + '{value:' + format_argument(response, escape_keys=False) + '}' -}}\n {%- endif -%}\n {{- '<tool_response|>' -}}\n{%- endmacro -%}\n\n{%- set ns = namespace(prev_message_type=None) -%}\n{%- set loop_messages = messages -%}\n{{- bos_token -}}\n{#- Handle System/Tool Definitions Block -#}\n{%- if (enable_thinking is defined and enable_thinking) or tools or messages[0]['role'] in ['system', 'developer'] -%}\n {{- '<|turn>system\\n' -}}\n {#- Inject Thinking token at the very top of the FIRST system turn -#}\n {%- if enable_thinking is defined and enable_thinking -%}\n {{- '<|think|>\\n' -}}\n {%- set ns.prev_message_type = 'think' -%}\n {%- endif -%}\n {%- if messages[0]['role'] in ['system', 'developer'] -%}\n {%- if messages[0]['content'] is string -%}\n {{- messages[0]['content'] | trim -}}\n {%- elif messages[0]['content'] is sequence -%}\n {%- for item in messages[0]['content'] -%}\n {{- item['text'] | trim + ' '-}}\n {%- endfor -%}\n {%- endif -%}\n {%- set loop_messages = messages[1:] -%}\n {%- endif -%}\n {%- if tools -%}\n {%- for tool in tools %}\n {{- '<|tool>' -}}\n {{- format_function_declaration(tool) | trim -}}\n {{- '<tool|>' -}}\n {%- endfor %}\n {%- set ns.prev_message_type = 'tool' -%}\n {%- if tool_choice is defined and ((tool_choice is string and tool_choice == 'required') or (tool_choice is mapping and tool_choice.get('type') == 'required')) -%}\n {{- '\\nTool use is REQUIRED for this turn: you must call exactly one of the declared tools. Output only the tool call, and copy every argument value verbatim \\u2014 character for character, preserving punctuation and newlines, with no added or removed whitespace.' -}}\n {%- endif -%}\n {%- endif -%}\n {{- '<turn|>\\n' -}}\n{%- endif %}\n\n{#- Pre-scan: find last user message index for reasoning guard -#}\n{%- set ns_turn = namespace(last_user_idx=-1) -%}\n{%- for i in range(loop_messages | length) -%}\n {%- if loop_messages[i]['role'] == 'user' -%}\n {%- set ns_turn.last_user_idx = i -%}\n {%- endif -%}\n{%- endfor -%}\n\n{#- Loop through messages -#}\n{%- for message in loop_messages -%}\n {%- if message['role'] != 'tool' -%}\n {%- set ns.prev_message_type = None -%}\n {%- set role = 'model' if message['role'] == 'assistant' else message['role'] -%}\n {#- Detect continuation: suppress duplicate <|turn>model when previous non-tool message was also assistant -#}\n {%- set prev_nt = namespace(role=None, found=false) -%}\n {%- if loop.index0 > 0 -%}\n {%- for j in range(loop.index0 - 1, -1, -1) -%}\n {%- if not prev_nt.found -%}\n {%- if loop_messages[j]['role'] != 'tool' -%}\n {%- set prev_nt.role = loop_messages[j]['role'] -%}\n {%- set prev_nt.found = true -%}\n {%- endif -%}\n {%- endif -%}\n {%- endfor -%}\n {%- endif -%}\n {%- set continue_same_model_turn = (role == 'model' and prev_nt.role == 'assistant') -%}\n {%- if not continue_same_model_turn -%}\n {{- '<|turn>' + role + '\\n' }}\n {%- endif -%}\n\n {#- Render reasoning/reasoning_content as thinking channel -#}\n {%- set thinking_text = message.get('reasoning') or message.get('reasoning_content') -%}\n {%- if thinking_text and loop.index0 > ns_turn.last_user_idx and message.get('tool_calls') -%}\n {{- '<|channel>thought\\n' + thinking_text + '\\n<channel|>' -}}\n {%- endif -%}\n\n {%- if message['tool_calls'] -%}\n {%- for tool_call in message['tool_calls'] -%}\n {%- set function = tool_call['function'] -%}\n {{- '<|tool_call>call:' + function['name'] + '{' -}}\n {%- if function['arguments'] is mapping -%}\n {%- set ns_args = namespace(found_first=false) -%}\n {%- for key, value in function['arguments'] | dictsort -%}\n {%- if ns_args.found_first %},{% endif -%}\n {%- set ns_args.found_first = true -%}\n {{- key -}}:{{- format_argument(value, escape_keys=False) -}}\n {%- endfor -%}\n {%- elif function['arguments'] is string -%}\n {{- function['arguments'] -}}\n {%- endif -%}\n {{- '}<tool_call|>' -}}\n {%- endfor -%}\n {%- set ns.prev_message_type = 'tool_call' -%}\n {%- endif -%}\n\n {%- set ns_tr_out = namespace(flag=false) -%}\n {%- if message.get('tool_responses') -%}\n {#- Legacy: tool_responses embedded on the assistant message (Google/Gemma native) -#}\n {%- for tool_response in message['tool_responses'] -%}\n {{- format_tool_response_block(tool_response['name'] | default('unknown'), tool_response['response']) -}}\n {%- set ns_tr_out.flag = true -%}\n {%- set ns.prev_message_type = 'tool_response' -%}\n {%- endfor -%}\n {%- elif message.get('tool_calls') -%}\n {#- OpenAI Chat Completions: forward-scan consecutive role:tool messages -#}\n {%- set ns_tool_scan = namespace(stopped=false) -%}\n {%- for k in range(loop.index0 + 1, loop_messages | length) -%}\n {%- if ns_tool_scan.stopped -%}\n {%- elif loop_messages[k]['role'] != 'tool' -%}\n {%- set ns_tool_scan.stopped = true -%}\n {%- else -%}\n {%- set follow = loop_messages[k] -%}\n {#- Resolve tool_call_id to function name -#}\n {%- set ns_tname = namespace(name=follow.get('name') | default('unknown')) -%}\n {%- for tc in message['tool_calls'] -%}\n {%- if tc.get('id') == follow.get('tool_call_id') -%}\n {%- set ns_tname.name = tc['function']['name'] -%}\n {%- endif -%}\n {%- endfor -%}\n {#- Handle content as string or content-parts array -#}\n {%- set tool_body = follow.get('content') -%}\n {%- if tool_body is string -%}\n {{- format_tool_response_block(ns_tname.name, tool_body) -}}\n {%- elif tool_body is sequence and tool_body is not string -%}\n {%- set ns_txt = namespace(s='') -%}\n {%- for part in tool_body -%}\n {%- if part.get('type') == 'text' -%}\n {%- set ns_txt.s = ns_txt.s + (part.get('text') | default('')) -%}\n {%- endif -%}\n {%- endfor -%}\n {{- format_tool_response_block(ns_tname.name, ns_txt.s) -}}\n {%- for part in tool_body -%}\n {%- if part.get('type') == 'image' -%}\n {{- '<|image|>' -}}\n {%- elif part.get('type') == 'audio' -%}\n {{- '<|audio|>' -}}\n {%- elif part.get('type') == 'video' -%}\n {{- '<|video|>' -}}\n {%- endif -%}\n {%- endfor -%}\n {%- else -%}\n {{- format_tool_response_block(ns_tname.name, tool_body) -}}\n {%- endif -%}\n {%- set ns_tr_out.flag = true -%}\n {%- set ns.prev_message_type = 'tool_response' -%}\n {%- endif -%}\n {%- endfor -%}\n {%- endif -%}\n\n {%- set captured_content -%}\n {%- if message['content'] is string -%}\n {%- if role == 'model' -%}\n {{- strip_thinking(message['content']) -}}\n {%- else -%}\n {{- message['content'] | trim -}}\n {%- endif -%}\n {%- elif message['content'] is sequence -%}\n {%- for item in message['content'] -%}\n {%- if item['type'] == 'text' -%}\n {%- if role == 'model' -%}\n {{- strip_thinking(item['text']) -}}\n {%- else -%}\n {{- item['text'] | trim -}}\n {%- endif -%}\n {%- elif item['type'] == 'image' -%}\n {{- '<|image|>' -}}\n {%- set ns.prev_message_type = 'image' -%}\n {%- elif item['type'] == 'audio' -%}\n {{- '<|audio|>' -}}\n {%- set ns.prev_message_type = 'audio' -%}\n {%- elif item['type'] == 'video' -%}\n {{- '<|video|>' -}}\n {%- set ns.prev_message_type = 'video' -%}\n {%- endif -%}\n {%- endfor -%}\n {%- endif -%}\n {%- endset -%}\n\n {{- captured_content -}}\n {%- set has_content = captured_content | trim | length > 0 -%}\n\n {%- if ns.prev_message_type == 'tool_call' and not ns_tr_out.flag -%}\n {{- '<|tool_response>' -}}\n {%- elif not (ns_tr_out.flag and not has_content) -%}\n {{- '<turn|>\\n' -}}\n {%- endif -%}\n {%- endif -%}\n{%- endfor -%}\n\n{%- if add_generation_prompt -%}\n {%- if ns.prev_message_type != 'tool_response' and ns.prev_message_type != 'tool_call' -%}\n {{- '<|turn>model\\n' -}}\n {%- endif -%}\n{%- endif -%}"
|
| 75 |
+
}
|
vmlx-banner.png
ADDED
|