DimensionSTP commited on
Commit
cf1ad88
Β·
1 Parent(s): 462ccd7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +134 -0
README.md ADDED
@@ -0,0 +1,134 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ language:
4
+ - ko
5
+ - en
6
+ tags:
7
+ - korean
8
+ - reasoning
9
+ - instruction-tuning
10
+ - fine-tuning
11
+ - kanana
12
+ - llama
13
+ - sft
14
+ ---
15
+
16
+ # 🧠 kanana-nano-2.1b-instruct-Ko-Reasoning
17
+
18
+ > A large-scale Korean reasoning model fine-tuned from **kakaocorp/kanana-nano-2.1b-instruct**, designed to excel in logical and multi-hop reasoning tasks in Korean.
19
+
20
+ ---
21
+
22
+ ## πŸ“Œ Overview
23
+
24
+ **kanana-nano-2.1b-instruct-Ko-Reasoning** is a fine-tuned version of [kakaocorp/kanana-nano-2.1b-instruct](https://huggingface.co/kakaocorp/kanana-nano-2.1b-instruct), specifically optimized for **logical reasoning in Korean**. This model is part of a broader research initiative to explore:
25
+
26
+ - The **transition from multilingual reasoning LLMs** to **Korean-specialized reasoning models**
27
+ - The enhancement of **non-reasoning Korean language models** into **reasoning-capable variants**
28
+ - The development of open-access models that rival proprietary alternatives in complex reasoning tasks
29
+
30
+ This model was fine-tuned using a large-scale Korean-English instruction dataset containing diverse multi-hop questions, symbolic logic tasks, and human-crafted reasoning steps.
31
+
32
+ ---
33
+
34
+ ## πŸ§ͺ Benchmark Results
35
+
36
+ > - πŸ“Š All benchmarks were measured using the **0-shot CoT (Chain-of-Thought)** method.
37
+ > - πŸ“Š The **Score** represents either the **accuracy (%)** of correct answers or a rating on a **1-10 scale** from a judge model.
38
+
39
+ | **Benchmark** | **Score** |
40
+ |------------------|---------------|
41
+ | GPQA diamond | 44.3 |
42
+ | GSM8K | 54.1 |
43
+ | HAERAE | 50.3 |
44
+ | KSM | 35.2 |
45
+ | Math500 | 66.9 |
46
+
47
+ ---
48
+
49
+ ## πŸ§‘β€πŸ’» Usage
50
+
51
+ Install Transformers >= 4.50:
52
+
53
+ ```bash
54
+ pip install -U transformers
55
+ ```
56
+
57
+ Basic example:
58
+
59
+ ```python
60
+ from transformers import AutoModelForCausalLM, AutoTokenizer
61
+
62
+ model_name = "DimensionSTP/kanana-nano-2.1b-instruct-Ko-Reasoning"
63
+
64
+ model = AutoModelForCausalLM.from_pretrained(
65
+ model_name,
66
+ torch_dtype="auto",
67
+ device_map="auto"
68
+ )
69
+ tokenizer = AutoTokenizer.from_pretrained(model_name)
70
+
71
+ prompt = "μ„œμšΈκ³Ό λΆ€μ‚° 쀑 μ–΄λ””κ°€ 더 컀?"
72
+ messages = [
73
+ {"role": "user", "content": prompt}
74
+ ]
75
+ text = tokenizer.apply_chat_template(
76
+ messages,
77
+ tokenize=False,
78
+ add_generation_prompt=True
79
+ )
80
+
81
+ model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
82
+
83
+ generated_ids = model.generate(
84
+ **model_inputs,
85
+ max_new_tokens=4096
86
+ )
87
+ generated_ids = [
88
+ output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
89
+ ]
90
+
91
+ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
92
+ print(response)
93
+ ```
94
+
95
+ ---
96
+
97
+ ## 🧠 Base Model: kakaocorp/kanana-nano-2.1b-instruct
98
+
99
+ The base model, [kakaocorp/kanana-nano-2.1b-instruct](https://huggingface.co/kakaocorp/kanana-nano-2.1b-instruct), is a LLM developed by the Kakao Kanana team.
100
+ For more technical details, refer to the [Kanana Technical Report](https://arxiv.org/pdf/2502.18934).
101
+
102
+ ---
103
+
104
+ ## 🧱 Model Architecture
105
+
106
+ | Property | Value |
107
+ |------------------|------------------------|
108
+ | Architecture | LlamaForCausalLM |
109
+ | Parameters | 2.1B |
110
+ | Context Length | 8,192 tokens |
111
+ | Tokenizer | LlamaTokenizer (BPE) |
112
+
113
+ ---
114
+
115
+ ## πŸ“… Release Date
116
+
117
+ **Mar 2025**
118
+ This model was released in March 2025 as part of the **Ko-Reasoning Series**, which focuses on pushing the boundaries of open-source reasoning in Korean using modern LLMs.
119
+
120
+ ---
121
+
122
+ ## πŸ“¬ Contact
123
+
124
+ For questions, collaborations, or deployment inquiries, please contact:
125
+
126
+ - πŸ€– Hugging Face: [https://huggingface.co/DimensionSTP](https://huggingface.co/DimensionSTP)
127
+ - βœ‰οΈ Email: [ddang8jh@gmail.com]
128
+
129
+ ---
130
+
131
+ ## πŸ“¦ Available Checkpoints
132
+
133
+ - βœ… `main`: Final stable version from the `last` branch
134
+ - βœ… All training artifacts available (tokenizer, config, model weights)