Instructions to use Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128")
model = AutoModelForCausalLM.from_pretrained("Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128

SGLang

How to use Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128 with Docker Model Runner:
```
docker model run hf.co/Hyun9junn/K-EXAONE-236B-A23B-W4A16-G128
```

Hyun9junn commited on Apr 8

Commit

facee65

verified ·

1 Parent(s): 3ba9c19

Add chat_template.jinja

Browse files

Files changed (1) hide show

chat_template.jinja +159 -0

chat_template.jinja ADDED Viewed

	@@ -0,0 +1,159 @@

+{% set image_count = namespace(value=0) %}
+{% set video_count = namespace(value=0) %}
+{%- set role_indicators = {
+    'user': '<|user|>\n',
+    'assistant': '<|assistant|>\n',
+    'system': '<|system|>\n',
+    'tool': '<|tool|>\n',
+    'tool_declare': '<|tool_declare|>\n'
+} %}
+{%- set end_of_turn = '<|endofturn|>\n' %}
+{%- macro declare_available_tools(tools) %}
+    {{- "# Tools" }}
+    {{- "\n" }}
+    {%- for tool in tools %}
+        {{- "<tool>" }}
+        {{- tool | tojson(ensure_ascii=False) | safe }}
+        {{- "</tool>\n" }}
+    {%- endfor %}
+{%- endmacro %}
+{%- set ns = namespace(last_query_index = messages|length - 1, last_query_index_not_yet_determined = true) %}
+{%- for message in messages[::-1] %}
+    {%- set index = (messages|length - 1) - loop.index0 %}
+    {%- if ns.last_query_index_not_yet_determined and message.role == "user" and message.content is string %}
+        {%- set ns.last_query_index = index -%}
+        {%- set ns.last_query_index_not_yet_determined = false -%}
+    {%- endif %}
+{%- endfor %}
+{%- if tools is defined and tools %}
+    {{- role_indicators['tool_declare'] }}
+    {{- declare_available_tools(tools) }}
+    {{- end_of_turn -}}
+{%- endif %}
+{%- for i in range(messages | length) %}
+    {%- set msg = messages[i] %}
+    {%- set role = msg.role %}
+    {%- if role not in role_indicators %}
+        {{- raise_exception('Unknown role: ' ~ role) }}
+    {%- endif %}
+    {%- if i == 0 %}
+        {%- if role == 'system' %}
+            {{- role_indicators['system'] }}
+            {{- msg.content }}
+            {{- end_of_turn -}}
+            {%- continue %}
+        {%- endif %}
+    {%- endif %}
+    {%- if role == 'assistant' %}
+        {{- role_indicators['assistant'] }}
+        {%- set content = (msg.content if (msg.content is defined and msg.content) else "") -%}
+        {%- set reasoning = none -%}
+        {%- if msg.reasoning_content is defined and msg.reasoning_content%}
+            {%- set reasoning = msg.reasoning_content.strip() -%}
+        {%- elif content and "</think>" in content %}
+            {%- set _parts = content.split('</think>') -%}
+            {%- set reasoning = _parts[0].lstrip('<think>').strip() -%}
+            {%- set content = _parts[-1].strip() -%}
+        {%- endif %}
+        {%- if not (reasoning and i > ns.last_query_index) or (skip_think is defined and skip_think) %}
+            {%- set reasoning = none %}
+        {%- endif %}
+        {%- set content = content.strip() -%}
+        {{- "<think>\n" }}
+        {{- (reasoning if reasoning is not none else "") }}
+        {{- "\n</think>\n\n" }}
+        {{- content }}
+        {%- if msg.tool_calls %}
+            {%- if content is defined and content %}
+                {{- "\n" }}
+            {%- endif %}
+            {%- for tool_call in msg.tool_calls %}
+                {%- if tool_call.function is defined %}
+                    {%- set tool_call = tool_call.function %}
+                {%- endif %}
+                {%- if tool_call.arguments is defined %}
+                    {%- set arguments = tool_call.arguments %}
+                {%- elif tool_call.parameters is defined %}
+                    {%- set arguments = tool_call.parameters %}
+                {%- else %}
+                    {{- raise_exception('arguments or parameters are mandatory: ' ~ tool_call) }}
+                {%- endif %}
+                {%- if arguments is string %}
+                    {{- "<tool_call>" }}{"name": "{{- tool_call.name }}", "arguments": {{ arguments }}}{{- "</tool_call>" }}
+                {%- else %}
+                    {{- "<tool_call>" }}{"name": "{{- tool_call.name }}", "arguments": {{ arguments | tojson(ensure_ascii=False) | safe }}}{{- "</tool_call>" }}
+                {%- endif %}
+                {%- if not loop.last %}
+                    {{- "\n" }}
+                {%- endif %}
+            {%- endfor %}
+        {%- endif %}
+        {{- end_of_turn -}}
+    {%- elif role == "tool" %}
+        {%- if i == 0 or messages[i - 1].role != "tool" %}
+            {{- role_indicators['tool'] }}
+        {%- endif %}
+        {%- if msg.content is defined %}
+            {%- if msg.content is string %}
+                {{- "<tool_result>" }}{{ msg.content }}{{- "</tool_result>" }}
+            {%- else %}
+                {{- "<tool_result>" }}{{ msg.content | tojson(ensure_ascii=False) | safe }}{{- "</tool_result>" }}
+            {%- endif %}
+        {%- endif %}
+        {%- if loop.last or messages[i + 1].role != "tool" %}
+            {{- end_of_turn -}}
+        {%- else %}
+            {{- "\n" }}
+        {%- endif %}
+    {%- else %}
+        {{- role_indicators[role] }}
+        {%- if msg.content is string %}
+            {{- msg.content }}
+        {%- else %}
+            {%- for content in msg.content %}
+                {%- if content.type == 'image' %}
+                    {%- set image_count.value = image_count.value + 1 %}
+                    {%- if add_vision_id %}Picture {{ image_count.value }}: {% endif %}<vision><image_pad></vision>
+                {%- elif content.type == 'video' %}
+                    {%- set video_count.value = video_count.value + 1 %}
+                    {%- if add_vision_id %}Video {{ video_count.value }}: {% endif %}<vision><video_pad></vision>
+                {%- elif content.type == 'text' %}
+                    {{- content.text }}
+                {%- else %}
+                    {{- content.text }}
+                {%- endif %}
+            {%- endfor %}
+        {%- endif %}
+        {{- end_of_turn -}}
+    {%- endif %}
+{% endfor %}
+{%- if add_generation_prompt %}
+    {{- role_indicators['assistant'] }}
+    {%- if enable_thinking is not defined or enable_thinking is true %}
+        {{- "<think>\n" }}
+    {%- else %}
+        {{- "<think>\n\n</think>\n\n" }}
+    {%- endif %}
+{%- endif %}