jpacifico commited on
Commit
8394465
·
verified ·
1 Parent(s): 3a83006

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -18
README.md CHANGED
@@ -147,30 +147,34 @@ Chocolatine-14B-Instruct-DPO-v1.2 outperforms its previous versions and its base
147
  ########## First turn ##########
148
  score
149
  model turn
150
- gpt-4o-mini 1 9.2875
151
- Chocolatine-2-14B-Instruct-v2.0.1 1 8.9125
152
- Chocolatine-14B-Instruct-4k-DPO 1 8.6375
153
- Chocolatine-14B-Instruct-DPO-v1.2 1 8.6125
154
- Phi-3.5-mini-instruct 1 8.5250
155
- Chocolatine-3B-Instruct-DPO-v1.2 1 8.3750
156
- phi-4 1 8.3000
157
- Phi-3-medium-4k-instruct 1 8.2250
158
- gpt-3.5-turbo 1 8.1375
159
- Chocolatine-3B-Instruct-DPO-Revised 1 7.9875
160
- Daredevil-8B 1 7.8875
161
- Meta-Llama-3.1-8B-Instruct 1 7.0500
162
- vigostral-7b-chat 1 6.7875
163
- Mistral-7B-Instruct-v0.3 1 6.7500
164
- gemma-2-2b-it 1 6.4500
165
- French-Alpaca-7B-Instruct_beta 1 5.6875
166
- vigogne-2-7b-chat 1 5.6625
 
 
167
 
168
  ########## Second turn ##########
169
  score
170
  model turn
171
  Chocolatine-2-14B-Instruct-v2.0.1 2 9.275000
172
  gpt-4o-mini 2 8.912500
 
173
  Chocolatine-14B-Instruct-DPO-v1.2 2 8.337500
 
174
  phi-4 2 8.131250
175
  Chocolatine-3B-Instruct-DPO-Revised 2 7.937500
176
  Chocolatine-3B-Instruct-DPO-v1.2 2 7.862500
@@ -191,7 +195,9 @@ vigogne-2-7b-chat 2 2.775000
191
  model
192
  gpt-4o-mini 9.100000
193
  Chocolatine-2-14B-Instruct-v2.0.1 9.093750
 
194
  Chocolatine-14B-Instruct-DPO-v1.2 8.475000
 
195
  phi-4 8.215625
196
  Chocolatine-14B-Instruct-4k-DPO 8.187500
197
  Chocolatine-3B-Instruct-DPO-v1.2 8.118750
@@ -212,7 +218,7 @@ vigogne-2-7b-chat 4.218750
212
 
213
  You can run this model using my [Colab notebook](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Chocolatine_14B_inference_test_colab.ipynb)
214
 
215
- You can also run Chocolatine using the following code:
216
 
217
  ```python
218
  import transformers
 
147
  ########## First turn ##########
148
  score
149
  model turn
150
+ gpt-4o-mini 1 9.287500
151
+ Chocolatine-2-14B-Instruct-v2.0.1 1 8.912500
152
+ Qwen2.5-14B-Instruct 1 8.887500
153
+ Chocolatine-14B-Instruct-4k-DPO 1 8.637500
154
+ Chocolatine-14B-Instruct-DPO-v1.2 1 8.612500
155
+ Phi-3.5-mini-instruct 1 8.525000
156
+ Chocolatine-3B-Instruct-DPO-v1.2 1 8.375000
157
+ DeepSeek-R1-Distill-Qwen-14B 1 8.375000
158
+ phi-4 1 8.300000
159
+ Phi-3-medium-4k-instruct 1 8.225000
160
+ gpt-3.5-turbo 1 8.137500
161
+ Chocolatine-3B-Instruct-DPO-Revised 1 7.987500
162
+ Daredevil-8B 1 7.887500
163
+ Meta-Llama-3.1-8B-Instruct 1 7.050000
164
+ vigostral-7b-chat 1 6.787500
165
+ Mistral-7B-Instruct-v0.3 1 6.750000
166
+ gemma-2-2b-it 1 6.450000
167
+ French-Alpaca-7B-Instruct_beta 1 5.687500
168
+ vigogne-2-7b-chat 1 5.662500
169
 
170
  ########## Second turn ##########
171
  score
172
  model turn
173
  Chocolatine-2-14B-Instruct-v2.0.1 2 9.275000
174
  gpt-4o-mini 2 8.912500
175
+ Qwen2.5-14B-Instruct 2 8.912500
176
  Chocolatine-14B-Instruct-DPO-v1.2 2 8.337500
177
+ DeepSeek-R1-Distill-Qwen-14B 2 8.200000
178
  phi-4 2 8.131250
179
  Chocolatine-3B-Instruct-DPO-Revised 2 7.937500
180
  Chocolatine-3B-Instruct-DPO-v1.2 2 7.862500
 
195
  model
196
  gpt-4o-mini 9.100000
197
  Chocolatine-2-14B-Instruct-v2.0.1 9.093750
198
+ Qwen2.5-14B-Instruct 8.900000
199
  Chocolatine-14B-Instruct-DPO-v1.2 8.475000
200
+ DeepSeek-R1-Distill-Qwen-14B 8.287500
201
  phi-4 8.215625
202
  Chocolatine-14B-Instruct-4k-DPO 8.187500
203
  Chocolatine-3B-Instruct-DPO-v1.2 8.118750
 
218
 
219
  You can run this model using my [Colab notebook](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Chocolatine_14B_inference_test_colab.ipynb)
220
 
221
+ You can also run Chocolatine-2 using the following code:
222
 
223
  ```python
224
  import transformers