wavy-jung commited on
Commit
273bbfc
·
verified ·
1 Parent(s): ad150d6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -11,7 +11,7 @@ developers: KananaAlpha LLM
11
  training_regime: bf16 mixed precision
12
  results: '| mmlu (5-shots) [acc] | kmmlu-direct (5-shots) [exact_match] | haerae (5-shots) [acc_norm] | gsm8k (5-shots) [exact_match_strict] | humaneval (0-shots) [pass@1] | mbpp (3-shots) [pass@1] |
13
  |------------------------|----------------------------------------|-------------------------------|----------------------------------------|--------------------------------|---------------------------|
14
- | 56.26 | 45.25 | 76.72 | 53.60 | 53.66 | 53.66 |'
15
  model_summary: Kanana-1.5-2.1b-base is an auto-regressive language model that
16
  uses an optimized transformer architecture. Kanana-1.5-2.1b-base uses a tokenizer
17
  with a vocabulary of 128K tokens, and supports sequence length of 32k.
@@ -78,7 +78,7 @@ model-index:
78
  type: google-research-datasets/mbpp
79
  metrics:
80
  - type: pass@1
81
- value: 53.66
82
  name: pass@1
83
  ---
84
  # Model Card for kakaocorp/kanana-1.5-2.1b-base
@@ -123,7 +123,7 @@ Kanana-1.5-2.1b-base was continuously pretrained from kakaocorp/kanana-essence-2
123
 
124
  | mmlu (5-shots) [acc] | kmmlu-direct (5-shots) [exact_match] | haerae (5-shots) [acc_norm] | gsm8k (5-shots) [exact_match_strict] | humaneval (0-shots) [pass@1] | mbpp (3-shots) [pass@1] |
125
  |------------------------|----------------------------------------|-------------------------------|----------------------------------------|--------------------------------|---------------------------|
126
- | 56.26 | 45.25 | 76.72 | 53.60 | 53.66 | 53.66 |
127
 
128
  ### Results for Long-Context Tasks
129
  | context length | ruler_niah_mk_2 [ruler_recall] | ruler_niah_mk_3 [ruler_recall] | ruler_niah_mv [ruler_recall] | json_kv [substring_exact_match] | niah [avg] | avg |
 
11
  training_regime: bf16 mixed precision
12
  results: '| mmlu (5-shots) [acc] | kmmlu-direct (5-shots) [exact_match] | haerae (5-shots) [acc_norm] | gsm8k (5-shots) [exact_match_strict] | humaneval (0-shots) [pass@1] | mbpp (3-shots) [pass@1] |
13
  |------------------------|----------------------------------------|-------------------------------|----------------------------------------|--------------------------------|---------------------------|
14
+ | 56.26 | 45.25 | 76.72 | 53.60 | 53.66 | 56.65 |'
15
  model_summary: Kanana-1.5-2.1b-base is an auto-regressive language model that
16
  uses an optimized transformer architecture. Kanana-1.5-2.1b-base uses a tokenizer
17
  with a vocabulary of 128K tokens, and supports sequence length of 32k.
 
78
  type: google-research-datasets/mbpp
79
  metrics:
80
  - type: pass@1
81
+ value: 56.65
82
  name: pass@1
83
  ---
84
  # Model Card for kakaocorp/kanana-1.5-2.1b-base
 
123
 
124
  | mmlu (5-shots) [acc] | kmmlu-direct (5-shots) [exact_match] | haerae (5-shots) [acc_norm] | gsm8k (5-shots) [exact_match_strict] | humaneval (0-shots) [pass@1] | mbpp (3-shots) [pass@1] |
125
  |------------------------|----------------------------------------|-------------------------------|----------------------------------------|--------------------------------|---------------------------|
126
+ | 56.26 | 45.25 | 76.72 | 53.60 | 53.66 | 56.65 |
127
 
128
  ### Results for Long-Context Tasks
129
  | context length | ruler_niah_mk_2 [ruler_recall] | ruler_niah_mk_3 [ruler_recall] | ruler_niah_mv [ruler_recall] | json_kv [substring_exact_match] | niah [avg] | avg |