Instructions to use zai-org/GLM-5.2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use zai-org/GLM-5.2 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="zai-org/GLM-5.2") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("zai-org/GLM-5.2") model = AutoModelForCausalLM.from_pretrained("zai-org/GLM-5.2") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- HuggingChat
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use zai-org/GLM-5.2 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "zai-org/GLM-5.2" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "zai-org/GLM-5.2", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/zai-org/GLM-5.2
- SGLang
How to use zai-org/GLM-5.2 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "zai-org/GLM-5.2" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "zai-org/GLM-5.2", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "zai-org/GLM-5.2" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "zai-org/GLM-5.2", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use zai-org/GLM-5.2 with Docker Model Runner:
docker model run hf.co/zai-org/GLM-5.2
Add files using upload-large-folder tool
Browse filesThis view is limited to 50 files because it contains too many changes. See raw diff
- .gitattributes +1 -0
- chat_template.jinja +119 -0
- config.json +223 -0
- generation_config.json +12 -0
- model-00001-of-00282.safetensors +3 -0
- model-00002-of-00282.safetensors +3 -0
- model-00003-of-00282.safetensors +3 -0
- model-00004-of-00282.safetensors +3 -0
- model-00005-of-00282.safetensors +3 -0
- model-00006-of-00282.safetensors +3 -0
- model-00007-of-00282.safetensors +3 -0
- model-00008-of-00282.safetensors +3 -0
- model-00009-of-00282.safetensors +3 -0
- model-00010-of-00282.safetensors +3 -0
- model-00011-of-00282.safetensors +3 -0
- model-00012-of-00282.safetensors +3 -0
- model-00013-of-00282.safetensors +3 -0
- model-00014-of-00282.safetensors +3 -0
- model-00015-of-00282.safetensors +3 -0
- model-00016-of-00282.safetensors +3 -0
- model-00024-of-00282.safetensors +3 -0
- model-00031-of-00282.safetensors +3 -0
- model-00257-of-00282.safetensors +3 -0
- model-00258-of-00282.safetensors +3 -0
- model-00259-of-00282.safetensors +3 -0
- model-00260-of-00282.safetensors +3 -0
- model-00261-of-00282.safetensors +3 -0
- model-00262-of-00282.safetensors +3 -0
- model-00263-of-00282.safetensors +3 -0
- model-00264-of-00282.safetensors +3 -0
- model-00265-of-00282.safetensors +3 -0
- model-00266-of-00282.safetensors +3 -0
- model-00267-of-00282.safetensors +3 -0
- model-00268-of-00282.safetensors +3 -0
- model-00269-of-00282.safetensors +3 -0
- model-00270-of-00282.safetensors +3 -0
- model-00271-of-00282.safetensors +3 -0
- model-00272-of-00282.safetensors +3 -0
- model-00273-of-00282.safetensors +3 -0
- model-00274-of-00282.safetensors +3 -0
- model-00275-of-00282.safetensors +3 -0
- model-00276-of-00282.safetensors +3 -0
- model-00277-of-00282.safetensors +3 -0
- model-00278-of-00282.safetensors +3 -0
- model-00279-of-00282.safetensors +3 -0
- model-00280-of-00282.safetensors +3 -0
- model-00281-of-00282.safetensors +3 -0
- model-00282-of-00282.safetensors +3 -0
- model.safetensors.index.json +0 -0
- tokenizer.json +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
chat_template.jinja
ADDED
|
@@ -0,0 +1,119 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
[gMASK]<sop>
|
| 2 |
+
{%- set effective_reasoning_effort = 'high' if reasoning_effort is defined and reasoning_effort == 'high' else 'max' -%}
|
| 3 |
+
{%- if (enable_thinking is not defined or enable_thinking) and effective_reasoning_effort is not none -%}<|system|>Reasoning Effort: {{ effective_reasoning_effort | capitalize }}{%- endif -%}
|
| 4 |
+
{%- if tools -%}
|
| 5 |
+
{%- macro tool_to_json(tool) -%}
|
| 6 |
+
{%- set ns_tool = namespace(first=true) -%}
|
| 7 |
+
{{ '{' -}}
|
| 8 |
+
{%- for k, v in tool.items() -%}
|
| 9 |
+
{%- if k != 'defer_loading' and k != 'strict' -%}
|
| 10 |
+
{%- if not ns_tool.first -%}{{- ', ' -}}{%- endif -%}
|
| 11 |
+
{%- set ns_tool.first = false -%}
|
| 12 |
+
"{{ k }}": {{ v | tojson(ensure_ascii=False) }}
|
| 13 |
+
{%- endif -%}
|
| 14 |
+
{%- endfor -%}
|
| 15 |
+
{{- '}' -}}
|
| 16 |
+
{%- endmacro -%}
|
| 17 |
+
<|system|>
|
| 18 |
+
# Tools
|
| 19 |
+
|
| 20 |
+
You may call one or more functions to assist with the user query.
|
| 21 |
+
|
| 22 |
+
You are provided with function signatures within <tools></tools> XML tags:
|
| 23 |
+
<tools>
|
| 24 |
+
{% for tool in tools %}
|
| 25 |
+
{%- if 'function' in tool -%}
|
| 26 |
+
{%- set tool = tool['function'] -%}
|
| 27 |
+
{%- endif -%}
|
| 28 |
+
{% if tool.defer_loading is not defined or not tool.defer_loading %}
|
| 29 |
+
{{ tool_to_json(tool) }}
|
| 30 |
+
{% endif %}
|
| 31 |
+
{% endfor %}
|
| 32 |
+
</tools>
|
| 33 |
+
|
| 34 |
+
For each function call, output the function name and arguments within the following XML format:
|
| 35 |
+
<tool_call>{function-name}<arg_key>{arg-key-1}</arg_key><arg_value>{arg-value-1}</arg_value><arg_key>{arg-key-2}</arg_key><arg_value>{arg-value-2}</arg_value>...</tool_call>{%- endif -%}
|
| 36 |
+
{%- macro visible_text(content) -%}
|
| 37 |
+
{%- if content is string -%}
|
| 38 |
+
{{- content }}
|
| 39 |
+
{%- elif content is iterable and content is not mapping -%}
|
| 40 |
+
{%- for item in content -%}
|
| 41 |
+
{%- if item is mapping and item.type == 'text' -%}
|
| 42 |
+
{{- item.text }}
|
| 43 |
+
{%- elif item is string -%}
|
| 44 |
+
{{- item }}
|
| 45 |
+
{%- elif item is mapping and item.type in ['image', 'image_url', 'video', 'video_url', 'audio', 'audio_url', 'input_audio'] -%}
|
| 46 |
+
{%- set media_type = item.type | replace('_url', '') | replace('input_', '') -%}
|
| 47 |
+
{{- "<reminder>You are unable to process this " ~ media_type ~ " because you don't have multi-modal input ability. Try different methods.</reminder>" }}
|
| 48 |
+
{%- endif -%}
|
| 49 |
+
{%- endfor -%}
|
| 50 |
+
{%- else -%}
|
| 51 |
+
{{- content }}
|
| 52 |
+
{%- endif -%}
|
| 53 |
+
{%- endmacro -%}
|
| 54 |
+
{%- set ns = namespace(last_user_index=-1) -%}
|
| 55 |
+
{%- for m in messages %}
|
| 56 |
+
{%- if m.role == 'user' %}
|
| 57 |
+
{%- set ns.last_user_index = loop.index0 -%}
|
| 58 |
+
{%- endif %}
|
| 59 |
+
{%- endfor %}
|
| 60 |
+
{%- for m in messages -%}
|
| 61 |
+
{%- if m.role == 'user' -%}<|user|>{{ visible_text(m.content) }}
|
| 62 |
+
{%- elif m.role == 'assistant' -%}
|
| 63 |
+
<|assistant|>
|
| 64 |
+
{%- set content = visible_text(m.content) %}
|
| 65 |
+
{%- if m.reasoning_content is string %}
|
| 66 |
+
{%- set reasoning_content = m.reasoning_content %}
|
| 67 |
+
{%- elif '</think>' in content %}
|
| 68 |
+
{%- set reasoning_content = content.split('</think>')[0].split('<think>')[-1] %}
|
| 69 |
+
{%- set content = content.split('</think>')[-1] %}
|
| 70 |
+
{%- endif %}
|
| 71 |
+
{%- if ((clear_thinking is defined and not clear_thinking) or loop.index0 > ns.last_user_index) and reasoning_content is defined -%}
|
| 72 |
+
{{ '<think>' + reasoning_content + '</think>'}}
|
| 73 |
+
{%- else -%}
|
| 74 |
+
{{ '<think></think>' }}
|
| 75 |
+
{%- endif -%}
|
| 76 |
+
{%- if content.strip() -%}
|
| 77 |
+
{{ content.strip() }}
|
| 78 |
+
{%- endif -%}
|
| 79 |
+
{% if m.tool_calls %}
|
| 80 |
+
{% for tc in m.tool_calls %}
|
| 81 |
+
{%- if tc.function %}
|
| 82 |
+
{%- set tc = tc.function %}
|
| 83 |
+
{%- endif %}
|
| 84 |
+
{{- '<tool_call>' + tc.name -}}
|
| 85 |
+
{% set _args = tc.arguments %}{% for k, v in _args.items() %}<arg_key>{{ k }}</arg_key><arg_value>{{ v | tojson(ensure_ascii=False) if v is not string else v }}</arg_value>{% endfor %}</tool_call>{% endfor %}
|
| 86 |
+
{% endif %}
|
| 87 |
+
{%- elif m.role == 'tool' -%}
|
| 88 |
+
{%- if loop.first or (messages[loop.index0 - 1].role != "tool") %}
|
| 89 |
+
{{- '<|observation|>' -}}
|
| 90 |
+
{%- endif %}
|
| 91 |
+
{%- if m.content is string -%}
|
| 92 |
+
{{- '<tool_response>' + m.content + '</tool_response>' -}}
|
| 93 |
+
{%- elif m.content is iterable and m.content is not mapping and m.content and m.content.0.type == "tool_reference" -%}
|
| 94 |
+
{{- '<tool_response><tools>\n' -}}
|
| 95 |
+
{% for tr in m.content %}
|
| 96 |
+
{%- for tool in tools -%}
|
| 97 |
+
{%- if 'function' in tool -%}
|
| 98 |
+
{%- set tool = tool['function'] -%}
|
| 99 |
+
{%- endif -%}
|
| 100 |
+
{%- if tool.name == tr.name -%}
|
| 101 |
+
{{- tool_to_json(tool) + '\n' -}}
|
| 102 |
+
{%- endif -%}
|
| 103 |
+
{%- endfor -%}
|
| 104 |
+
{%- endfor -%}
|
| 105 |
+
{{- '</tools></tool_response>' -}}
|
| 106 |
+
{%- elif m.content is iterable and m.content is not mapping and m.content and m.content.0 is mapping and m.content.0.output is defined -%}
|
| 107 |
+
{%- for tr in m.content -%}
|
| 108 |
+
{{- '<tool_response>' + tr.output + '</tool_response>' -}}
|
| 109 |
+
{%- endfor -%}
|
| 110 |
+
{%- else -%}
|
| 111 |
+
{{- '<tool_response>' + visible_text(m.content) + '</tool_response>' -}}
|
| 112 |
+
{% endif -%}
|
| 113 |
+
{%- elif m.role == 'system' -%}
|
| 114 |
+
<|system|>{{ visible_text(m.content) }}
|
| 115 |
+
{%- endif -%}
|
| 116 |
+
{%- endfor -%}
|
| 117 |
+
{%- if add_generation_prompt -%}
|
| 118 |
+
<|assistant|>{{- '<think></think>' if (enable_thinking is defined and not enable_thinking) else '<think>' -}}
|
| 119 |
+
{%- endif -%}
|
config.json
ADDED
|
@@ -0,0 +1,223 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"architectures": [
|
| 3 |
+
"GlmMoeDsaForCausalLM"
|
| 4 |
+
],
|
| 5 |
+
"attention_bias": false,
|
| 6 |
+
"attention_dropout": 0.0,
|
| 7 |
+
"dtype": "bfloat16",
|
| 8 |
+
"eos_token_id": [
|
| 9 |
+
154820,
|
| 10 |
+
154827,
|
| 11 |
+
154829
|
| 12 |
+
],
|
| 13 |
+
"ep_size": 1,
|
| 14 |
+
"first_k_dense_replace": 3,
|
| 15 |
+
"head_dim": 192,
|
| 16 |
+
"hidden_act": "silu",
|
| 17 |
+
"hidden_size": 6144,
|
| 18 |
+
"index_head_dim": 128,
|
| 19 |
+
"index_n_heads": 32,
|
| 20 |
+
"index_share_for_mtp_iteration": true,
|
| 21 |
+
"index_skip_topk_offset": 3,
|
| 22 |
+
"index_topk": 2048,
|
| 23 |
+
"index_topk_freq": 4,
|
| 24 |
+
"index_topk_pattern": null,
|
| 25 |
+
"indexer_rope_interleave": true,
|
| 26 |
+
"indexer_types": [
|
| 27 |
+
"full",
|
| 28 |
+
"full",
|
| 29 |
+
"full",
|
| 30 |
+
"shared",
|
| 31 |
+
"shared",
|
| 32 |
+
"shared",
|
| 33 |
+
"full",
|
| 34 |
+
"shared",
|
| 35 |
+
"shared",
|
| 36 |
+
"shared",
|
| 37 |
+
"full",
|
| 38 |
+
"shared",
|
| 39 |
+
"shared",
|
| 40 |
+
"shared",
|
| 41 |
+
"full",
|
| 42 |
+
"shared",
|
| 43 |
+
"shared",
|
| 44 |
+
"shared",
|
| 45 |
+
"full",
|
| 46 |
+
"shared",
|
| 47 |
+
"shared",
|
| 48 |
+
"shared",
|
| 49 |
+
"full",
|
| 50 |
+
"shared",
|
| 51 |
+
"shared",
|
| 52 |
+
"shared",
|
| 53 |
+
"full",
|
| 54 |
+
"shared",
|
| 55 |
+
"shared",
|
| 56 |
+
"shared",
|
| 57 |
+
"full",
|
| 58 |
+
"shared",
|
| 59 |
+
"shared",
|
| 60 |
+
"shared",
|
| 61 |
+
"full",
|
| 62 |
+
"shared",
|
| 63 |
+
"shared",
|
| 64 |
+
"shared",
|
| 65 |
+
"full",
|
| 66 |
+
"shared",
|
| 67 |
+
"shared",
|
| 68 |
+
"shared",
|
| 69 |
+
"full",
|
| 70 |
+
"shared",
|
| 71 |
+
"shared",
|
| 72 |
+
"shared",
|
| 73 |
+
"full",
|
| 74 |
+
"shared",
|
| 75 |
+
"shared",
|
| 76 |
+
"shared",
|
| 77 |
+
"full",
|
| 78 |
+
"shared",
|
| 79 |
+
"shared",
|
| 80 |
+
"shared",
|
| 81 |
+
"full",
|
| 82 |
+
"shared",
|
| 83 |
+
"shared",
|
| 84 |
+
"shared",
|
| 85 |
+
"full",
|
| 86 |
+
"shared",
|
| 87 |
+
"shared",
|
| 88 |
+
"shared",
|
| 89 |
+
"full",
|
| 90 |
+
"shared",
|
| 91 |
+
"shared",
|
| 92 |
+
"shared",
|
| 93 |
+
"full",
|
| 94 |
+
"shared",
|
| 95 |
+
"shared",
|
| 96 |
+
"shared",
|
| 97 |
+
"full",
|
| 98 |
+
"shared",
|
| 99 |
+
"shared",
|
| 100 |
+
"shared",
|
| 101 |
+
"full",
|
| 102 |
+
"shared",
|
| 103 |
+
"shared",
|
| 104 |
+
"shared"
|
| 105 |
+
],
|
| 106 |
+
"initializer_range": 0.02,
|
| 107 |
+
"intermediate_size": 12288,
|
| 108 |
+
"kv_lora_rank": 512,
|
| 109 |
+
"max_position_embeddings": 1048576,
|
| 110 |
+
"mlp_layer_types": [
|
| 111 |
+
"dense",
|
| 112 |
+
"dense",
|
| 113 |
+
"dense",
|
| 114 |
+
"sparse",
|
| 115 |
+
"sparse",
|
| 116 |
+
"sparse",
|
| 117 |
+
"sparse",
|
| 118 |
+
"sparse",
|
| 119 |
+
"sparse",
|
| 120 |
+
"sparse",
|
| 121 |
+
"sparse",
|
| 122 |
+
"sparse",
|
| 123 |
+
"sparse",
|
| 124 |
+
"sparse",
|
| 125 |
+
"sparse",
|
| 126 |
+
"sparse",
|
| 127 |
+
"sparse",
|
| 128 |
+
"sparse",
|
| 129 |
+
"sparse",
|
| 130 |
+
"sparse",
|
| 131 |
+
"sparse",
|
| 132 |
+
"sparse",
|
| 133 |
+
"sparse",
|
| 134 |
+
"sparse",
|
| 135 |
+
"sparse",
|
| 136 |
+
"sparse",
|
| 137 |
+
"sparse",
|
| 138 |
+
"sparse",
|
| 139 |
+
"sparse",
|
| 140 |
+
"sparse",
|
| 141 |
+
"sparse",
|
| 142 |
+
"sparse",
|
| 143 |
+
"sparse",
|
| 144 |
+
"sparse",
|
| 145 |
+
"sparse",
|
| 146 |
+
"sparse",
|
| 147 |
+
"sparse",
|
| 148 |
+
"sparse",
|
| 149 |
+
"sparse",
|
| 150 |
+
"sparse",
|
| 151 |
+
"sparse",
|
| 152 |
+
"sparse",
|
| 153 |
+
"sparse",
|
| 154 |
+
"sparse",
|
| 155 |
+
"sparse",
|
| 156 |
+
"sparse",
|
| 157 |
+
"sparse",
|
| 158 |
+
"sparse",
|
| 159 |
+
"sparse",
|
| 160 |
+
"sparse",
|
| 161 |
+
"sparse",
|
| 162 |
+
"sparse",
|
| 163 |
+
"sparse",
|
| 164 |
+
"sparse",
|
| 165 |
+
"sparse",
|
| 166 |
+
"sparse",
|
| 167 |
+
"sparse",
|
| 168 |
+
"sparse",
|
| 169 |
+
"sparse",
|
| 170 |
+
"sparse",
|
| 171 |
+
"sparse",
|
| 172 |
+
"sparse",
|
| 173 |
+
"sparse",
|
| 174 |
+
"sparse",
|
| 175 |
+
"sparse",
|
| 176 |
+
"sparse",
|
| 177 |
+
"sparse",
|
| 178 |
+
"sparse",
|
| 179 |
+
"sparse",
|
| 180 |
+
"sparse",
|
| 181 |
+
"sparse",
|
| 182 |
+
"sparse",
|
| 183 |
+
"sparse",
|
| 184 |
+
"sparse",
|
| 185 |
+
"sparse",
|
| 186 |
+
"sparse",
|
| 187 |
+
"sparse",
|
| 188 |
+
"sparse"
|
| 189 |
+
],
|
| 190 |
+
"model_type": "glm_moe_dsa",
|
| 191 |
+
"moe_intermediate_size": 2048,
|
| 192 |
+
"moe_layer_freq": 1,
|
| 193 |
+
"n_group": 1,
|
| 194 |
+
"n_routed_experts": 256,
|
| 195 |
+
"n_shared_experts": 1,
|
| 196 |
+
"norm_topk_prob": true,
|
| 197 |
+
"num_attention_heads": 64,
|
| 198 |
+
"num_experts_per_tok": 8,
|
| 199 |
+
"num_hidden_layers": 78,
|
| 200 |
+
"num_key_value_heads": 64,
|
| 201 |
+
"num_nextn_predict_layers": 1,
|
| 202 |
+
"pad_token_id": 154820,
|
| 203 |
+
"pretraining_tp": 1,
|
| 204 |
+
"q_lora_rank": 2048,
|
| 205 |
+
"qk_head_dim": 256,
|
| 206 |
+
"qk_nope_head_dim": 192,
|
| 207 |
+
"qk_rope_head_dim": 64,
|
| 208 |
+
"rms_norm_eps": 1e-05,
|
| 209 |
+
"rope_interleave": true,
|
| 210 |
+
"rope_parameters": {
|
| 211 |
+
"rope_theta": 8000000,
|
| 212 |
+
"rope_type": "default"
|
| 213 |
+
},
|
| 214 |
+
"routed_scaling_factor": 2.5,
|
| 215 |
+
"scoring_func": "sigmoid",
|
| 216 |
+
"tie_word_embeddings": false,
|
| 217 |
+
"topk_group": 1,
|
| 218 |
+
"topk_method": "noaux_tc",
|
| 219 |
+
"transformers_version": "5.12.0",
|
| 220 |
+
"use_cache": true,
|
| 221 |
+
"v_head_dim": 256,
|
| 222 |
+
"vocab_size": 154880
|
| 223 |
+
}
|
generation_config.json
ADDED
|
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"_from_model_config": true,
|
| 3 |
+
"eos_token_id": [
|
| 4 |
+
154820,
|
| 5 |
+
154827,
|
| 6 |
+
154829
|
| 7 |
+
],
|
| 8 |
+
"pad_token_id": 154820,
|
| 9 |
+
"temperature": 1.0,
|
| 10 |
+
"top_p": 0.95,
|
| 11 |
+
"transformers_version": "5.12.0"
|
| 12 |
+
}
|
model-00001-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:004bf9404964da8ea71ea2d3ebf02148fa766b956bd4fca3f54b093e58a6a74c
|
| 3 |
+
size 5342821416
|
model-00002-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7682416ae5caa51698994d71d011bf308016f97bc33fe2f8187f37cd73b85e39
|
| 3 |
+
size 5351970840
|
model-00003-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3427fbf78d22254a65e9ea918ab724ce9dbb0f9231c7054fbb3b7a65e132d13c
|
| 3 |
+
size 5360347320
|
model-00004-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c5cc75d06e0f5a693a21142594dda258dc46eb46ae14e33ab66310a2fd664c46
|
| 3 |
+
size 5360347208
|
model-00005-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7e5461ed71639e7496b8f4ebb71e4faffec78fdf97eeea424f8b0d06f1747402
|
| 3 |
+
size 5359985352
|
model-00006-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c4225900bd3fb7de29a27f0c613680a8d38c7571753560837878115c58f06715
|
| 3 |
+
size 5360347320
|
model-00007-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ba48cc4205a98fd1f6db7b62941439a7497346e3cb8e77f9e77f04afc094ec03
|
| 3 |
+
size 5360347320
|
model-00008-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2f2bf12168f93157a568ef0a5e7e2e2ed107c4d6f08da234bdaccb20f69dfd50
|
| 3 |
+
size 5360347144
|
model-00009-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b98cbef0349b6ce166add44223d299c315898f3a62d8abb2cdd6e7e2ea87f99d
|
| 3 |
+
size 5366406944
|
model-00010-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dd95904337bc2f7afb5815c4ec8723cadd7d2899c3ffae39c06b95c09add7088
|
| 3 |
+
size 5360347312
|
model-00011-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b25431bc8f47afa28e7c54e7dd5c602124ba6d29eb29c1ae85803f3a2a8f4c5c
|
| 3 |
+
size 5360347288
|
model-00012-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2b547be1d742f78230e1358808ad26b373ed6855017cb8230dc80f503d49a91e
|
| 3 |
+
size 5363507640
|
model-00013-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0d6007e4f7e0eb084782bf2cccdd5d2d143d41939b033439e9fe3525abe85b47
|
| 3 |
+
size 5363246488
|
model-00014-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6af3afe8e57aa38d4765ed60b3d9d3db17ac3b9edcb1221a9e3803c26186a070
|
| 3 |
+
size 5360347312
|
model-00015-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8973e20da1e7a417d7a5c1ca937c679411e15d30e6b1c681bb48e8b1b67ec48a
|
| 3 |
+
size 5360347224
|
model-00016-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d2933dc4032c2248ca122be4cb1432dd8dd553d9967a6a515a82485662e92f99
|
| 3 |
+
size 5366406864
|
model-00024-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:622a574bb87ed480e8a7c6f5d97d96c2e746ebd9d65fd3300a5b5f06078dc630
|
| 3 |
+
size 5366407008
|
model-00031-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b6fd157724a0a1c675877cb56885a4cb0ca015d17e93ce075ece80aed7173584
|
| 3 |
+
size 5366406928
|
model-00257-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:695b48bf60615b14e3aadcf372da249b7069068b00c32854d45a5527b6ccc72c
|
| 3 |
+
size 5360347312
|
model-00258-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:45c039e78bf83337731a538eae15fb840e97a704695d73421e6b7413d8534e08
|
| 3 |
+
size 5360347256
|
model-00259-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4cccb92772dc1727cda0c91244a7cea82b0437e6308da562f9e44a02c21fd5f6
|
| 3 |
+
size 5359985296
|
model-00260-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fe4937f02ceb3cd40ceed1eaf6bda7730e1c90f4d9f15af3b1381cd819ec9ee9
|
| 3 |
+
size 5360347320
|
model-00261-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6f98cd9414f850c35420da6ffc038257cf519e0dc2f1f493f2ee892cc1efcd09
|
| 3 |
+
size 5360347312
|
model-00262-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d59cb92290d524e497cd58f49b868769931e63eccf13161867b5be8c1b8fe61f
|
| 3 |
+
size 5360347192
|
model-00263-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8073e008e49f00eebf3cc88ad7e37491ae157b1f77f24afdb944256f4aa92d79
|
| 3 |
+
size 5366406896
|
model-00264-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d695bc81d74a3e317ed396c45f1f50ba4b295f8583cc2bf123bc64e7eb782203
|
| 3 |
+
size 5360347320
|
model-00265-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:df91d6dc3076988840643160da0b036b6acbbb6884fa9699f368fc3546696c3c
|
| 3 |
+
size 5360347312
|
model-00266-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5091513ae3a8c315c2f8e4afb20c40e2c4364b1c1e6e908e21f132f6d7578625
|
| 3 |
+
size 5360347128
|
model-00267-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:72531de28c8221463eea9f82c1e2c6bc6fb6e2b5c89c21e3237dc707516bc7ba
|
| 3 |
+
size 5366406960
|
model-00268-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:db6bfce751a1a56ae17856631552fb4f83821c300822a909da48267f5d82a976
|
| 3 |
+
size 5360347320
|
model-00269-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:49d6a9e4c256bc01d18fac9dd8d68177f1852a4b0e9d5f7b8d7971f9a3075652
|
| 3 |
+
size 5360347264
|
model-00270-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d74106256f061e73000e9660d157bd22254d2a5692cf9466d76dfea6985c0924
|
| 3 |
+
size 5366430968
|
model-00271-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:90ba74c758309888b9d3f17adc189e32e77cee77ca8f7892ad12a9b38956cd43
|
| 3 |
+
size 5360347304
|
model-00272-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d5c9dbfba6aff2be069079cf39c8991393a2b69469a4f0ac4e246af25d519a06
|
| 3 |
+
size 5360347320
|
model-00273-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1344c75f27e5564baa46641ebcdf19a2a13ac1b2cfc52e01744b6dda1127aa94
|
| 3 |
+
size 5360347208
|
model-00274-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1943b335a5aa626389e819fa5a7339c361844c425389b35328eba0142935fbf6
|
| 3 |
+
size 5359997688
|
model-00275-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f1177dcf412931a0fcb5803f7028e7b4de7bf56462d2c650649d58e884e9cf67
|
| 3 |
+
size 5360347104
|
model-00276-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:34323416e94255ac4b0fc0f0de22a0bc05b4e7c0c0c9fb95ea69e7a5f1c34ad4
|
| 3 |
+
size 5360347104
|
model-00277-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1fc3feb8bea3dc74d7242f4fbd6d5738376de70e1f6731f1d84125dee95fbc1f
|
| 3 |
+
size 5360346928
|
model-00278-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:da33beb05c67106489010343adad8fbaa9d56e258e26aa88c30af1c37d2e3fb3
|
| 3 |
+
size 5366406736
|
model-00279-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:24f1c153e5490c2b0669d73fb93571cec783ec00056fc733b8b2b1078c169b2c
|
| 3 |
+
size 5360347104
|
model-00280-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c41e9b458eee29b83dab6bef0209f85713d53d05a7f0af768b5db4c0910948bc
|
| 3 |
+
size 5360347072
|
model-00281-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1e95cc485333cd0feb737360716b9398d5a23620a1d4127462403b3decba19cb
|
| 3 |
+
size 5324447848
|
model-00282-of-00282.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:46c23d3a25db83ab8b7e47d0a5efec49e00f57a815358323f1ab45136de327d7
|
| 3 |
+
size 293618224
|
model.safetensors.index.json
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
tokenizer.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:19e773648cb4e65de8660ea6365e10acca112d42a854923df93db4a6f333a82d
|
| 3 |
+
size 20217442
|