6Morpheus6 commited on
Commit
fb8e32c
·
verified ·
1 Parent(s): ddc7e24

Upload 4 files

Browse files
Files changed (3) hide show
  1. README.md +2601 -0
  2. adapter_config.json +35 -0
  3. adapter_model.safetensors +3 -0
README.md ADDED
@@ -0,0 +1,2601 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: peft
3
+ license: mit
4
+ language:
5
+ - en
6
+ pipeline_tag: sentence-similarity
7
+ tags:
8
+ - text-embedding
9
+ - embeddings
10
+ - information-retrieval
11
+ - beir
12
+ - text-classification
13
+ - language-model
14
+ - text-clustering
15
+ - text-semantic-similarity
16
+ - text-evaluation
17
+ - text-reranking
18
+ - feature-extraction
19
+ - sentence-similarity
20
+ - Sentence Similarity
21
+ - natural_questions
22
+ - ms_marco
23
+ - fever
24
+ - hotpot_qa
25
+ - mteb
26
+ model-index:
27
+ - name: LLM2Vec-Meta-Llama-3-supervised
28
+ results:
29
+ - task:
30
+ type: Classification
31
+ dataset:
32
+ type: mteb/amazon_counterfactual
33
+ name: MTEB AmazonCounterfactualClassification (en)
34
+ config: en
35
+ split: test
36
+ revision: e8379541af4e31359cca9fbcf4b00f2671dba205
37
+ metrics:
38
+ - type: accuracy
39
+ value: 79.94029850746269
40
+ - type: ap
41
+ value: 44.93223506764482
42
+ - type: f1
43
+ value: 74.30328994013465
44
+ - task:
45
+ type: Classification
46
+ dataset:
47
+ type: mteb/amazon_polarity
48
+ name: MTEB AmazonPolarityClassification
49
+ config: default
50
+ split: test
51
+ revision: e2d317d38cd51312af73b3d32a06d1a08b442046
52
+ metrics:
53
+ - type: accuracy
54
+ value: 86.06680000000001
55
+ - type: ap
56
+ value: 81.97124658709345
57
+ - type: f1
58
+ value: 86.00558036874241
59
+ - task:
60
+ type: Classification
61
+ dataset:
62
+ type: mteb/amazon_reviews_multi
63
+ name: MTEB AmazonReviewsClassification (en)
64
+ config: en
65
+ split: test
66
+ revision: 1399c76144fd37290681b995c656ef9b2e06e26d
67
+ metrics:
68
+ - type: accuracy
69
+ value: 46.836
70
+ - type: f1
71
+ value: 46.05094679201488
72
+ - task:
73
+ type: Retrieval
74
+ dataset:
75
+ type: arguana
76
+ name: MTEB ArguAna
77
+ config: default
78
+ split: test
79
+ revision: None
80
+ metrics:
81
+ - type: map_at_1
82
+ value: 37.980000000000004
83
+ - type: map_at_10
84
+ value: 54.167
85
+ - type: map_at_100
86
+ value: 54.735
87
+ - type: map_at_1000
88
+ value: 54.738
89
+ - type: map_at_3
90
+ value: 49.384
91
+ - type: map_at_5
92
+ value: 52.285000000000004
93
+ - type: mrr_at_1
94
+ value: 38.549
95
+ - type: mrr_at_10
96
+ value: 54.351000000000006
97
+ - type: mrr_at_100
98
+ value: 54.932
99
+ - type: mrr_at_1000
100
+ value: 54.935
101
+ - type: mrr_at_3
102
+ value: 49.585
103
+ - type: mrr_at_5
104
+ value: 52.469
105
+ - type: ndcg_at_1
106
+ value: 37.980000000000004
107
+ - type: ndcg_at_10
108
+ value: 62.778999999999996
109
+ - type: ndcg_at_100
110
+ value: 64.986
111
+ - type: ndcg_at_1000
112
+ value: 65.036
113
+ - type: ndcg_at_3
114
+ value: 53.086999999999996
115
+ - type: ndcg_at_5
116
+ value: 58.263
117
+ - type: precision_at_1
118
+ value: 37.980000000000004
119
+ - type: precision_at_10
120
+ value: 9.011
121
+ - type: precision_at_100
122
+ value: 0.993
123
+ - type: precision_at_1000
124
+ value: 0.1
125
+ - type: precision_at_3
126
+ value: 21.266
127
+ - type: precision_at_5
128
+ value: 15.248999999999999
129
+ - type: recall_at_1
130
+ value: 37.980000000000004
131
+ - type: recall_at_10
132
+ value: 90.114
133
+ - type: recall_at_100
134
+ value: 99.289
135
+ - type: recall_at_1000
136
+ value: 99.644
137
+ - type: recall_at_3
138
+ value: 63.798
139
+ - type: recall_at_5
140
+ value: 76.24499999999999
141
+ - task:
142
+ type: Clustering
143
+ dataset:
144
+ type: mteb/arxiv-clustering-p2p
145
+ name: MTEB ArxivClusteringP2P
146
+ config: default
147
+ split: test
148
+ revision: a122ad7f3f0291bf49cc6f4d32aa80929df69d5d
149
+ metrics:
150
+ - type: v_measure
151
+ value: 44.27081216556421
152
+ - task:
153
+ type: Clustering
154
+ dataset:
155
+ type: mteb/arxiv-clustering-s2s
156
+ name: MTEB ArxivClusteringS2S
157
+ config: default
158
+ split: test
159
+ revision: f910caf1a6075f7329cdf8c1a6135696f37dbd53
160
+ metrics:
161
+ - type: v_measure
162
+ value: 46.8490872532913
163
+ - task:
164
+ type: Reranking
165
+ dataset:
166
+ type: mteb/askubuntudupquestions-reranking
167
+ name: MTEB AskUbuntuDupQuestions
168
+ config: default
169
+ split: test
170
+ revision: 2000358ca161889fa9c082cb41daa8dcfb161a54
171
+ metrics:
172
+ - type: map
173
+ value: 65.18525400430678
174
+ - type: mrr
175
+ value: 78.80149936244119
176
+ - task:
177
+ type: STS
178
+ dataset:
179
+ type: mteb/biosses-sts
180
+ name: MTEB BIOSSES
181
+ config: default
182
+ split: test
183
+ revision: d3fb88f8f02e40887cd149695127462bbcf29b4a
184
+ metrics:
185
+ - type: cos_sim_spearman
186
+ value: 84.92301936595548
187
+ - task:
188
+ type: Classification
189
+ dataset:
190
+ type: mteb/banking77
191
+ name: MTEB Banking77Classification
192
+ config: default
193
+ split: test
194
+ revision: 0fd18e25b25c072e09e0d92ab615fda904d66300
195
+ metrics:
196
+ - type: accuracy
197
+ value: 88.0487012987013
198
+ - type: f1
199
+ value: 88.00953788281542
200
+ - task:
201
+ type: Clustering
202
+ dataset:
203
+ type: mteb/biorxiv-clustering-p2p
204
+ name: MTEB BiorxivClusteringP2P
205
+ config: default
206
+ split: test
207
+ revision: 65b79d1d13f80053f67aca9498d9402c2d9f1f40
208
+ metrics:
209
+ - type: v_measure
210
+ value: 32.34687321141145
211
+ - task:
212
+ type: Clustering
213
+ dataset:
214
+ type: mteb/biorxiv-clustering-s2s
215
+ name: MTEB BiorxivClusteringS2S
216
+ config: default
217
+ split: test
218
+ revision: 258694dd0231531bc1fd9de6ceb52a0853c6d908
219
+ metrics:
220
+ - type: v_measure
221
+ value: 36.69881680534123
222
+ - task:
223
+ type: Retrieval
224
+ dataset:
225
+ type: cqadupstack/android
226
+ name: MTEB CQADupstackAndroidRetrieval
227
+ config: default
228
+ split: test
229
+ revision: None
230
+ metrics:
231
+ - type: map_at_1
232
+ value: 37.742
233
+ - type: map_at_10
234
+ value: 51.803
235
+ - type: map_at_100
236
+ value: 53.556000000000004
237
+ - type: map_at_1000
238
+ value: 53.652
239
+ - type: map_at_3
240
+ value: 47.286
241
+ - type: map_at_5
242
+ value: 50.126000000000005
243
+ - type: mrr_at_1
244
+ value: 46.924
245
+ - type: mrr_at_10
246
+ value: 57.857
247
+ - type: mrr_at_100
248
+ value: 58.592
249
+ - type: mrr_at_1000
250
+ value: 58.619
251
+ - type: mrr_at_3
252
+ value: 55.340999999999994
253
+ - type: mrr_at_5
254
+ value: 57.150999999999996
255
+ - type: ndcg_at_1
256
+ value: 46.924
257
+ - type: ndcg_at_10
258
+ value: 58.733999999999995
259
+ - type: ndcg_at_100
260
+ value: 63.771
261
+ - type: ndcg_at_1000
262
+ value: 64.934
263
+ - type: ndcg_at_3
264
+ value: 53.189
265
+ - type: ndcg_at_5
266
+ value: 56.381
267
+ - type: precision_at_1
268
+ value: 46.924
269
+ - type: precision_at_10
270
+ value: 11.431
271
+ - type: precision_at_100
272
+ value: 1.73
273
+ - type: precision_at_1000
274
+ value: 0.213
275
+ - type: precision_at_3
276
+ value: 25.942
277
+ - type: precision_at_5
278
+ value: 19.113
279
+ - type: recall_at_1
280
+ value: 37.742
281
+ - type: recall_at_10
282
+ value: 71.34
283
+ - type: recall_at_100
284
+ value: 91.523
285
+ - type: recall_at_1000
286
+ value: 98.494
287
+ - type: recall_at_3
288
+ value: 55.443
289
+ - type: recall_at_5
290
+ value: 64.122
291
+ - task:
292
+ type: Retrieval
293
+ dataset:
294
+ type: cqadupstack/english
295
+ name: MTEB CQADupstackEnglishRetrieval
296
+ config: default
297
+ split: test
298
+ revision: None
299
+ metrics:
300
+ - type: map_at_1
301
+ value: 34.183
302
+ - type: map_at_10
303
+ value: 46.837
304
+ - type: map_at_100
305
+ value: 48.126000000000005
306
+ - type: map_at_1000
307
+ value: 48.25
308
+ - type: map_at_3
309
+ value: 43.171
310
+ - type: map_at_5
311
+ value: 45.318999999999996
312
+ - type: mrr_at_1
313
+ value: 43.376
314
+ - type: mrr_at_10
315
+ value: 52.859
316
+ - type: mrr_at_100
317
+ value: 53.422000000000004
318
+ - type: mrr_at_1000
319
+ value: 53.456
320
+ - type: mrr_at_3
321
+ value: 50.434999999999995
322
+ - type: mrr_at_5
323
+ value: 51.861999999999995
324
+ - type: ndcg_at_1
325
+ value: 43.376
326
+ - type: ndcg_at_10
327
+ value: 53.223
328
+ - type: ndcg_at_100
329
+ value: 57.175
330
+ - type: ndcg_at_1000
331
+ value: 58.86900000000001
332
+ - type: ndcg_at_3
333
+ value: 48.417
334
+ - type: ndcg_at_5
335
+ value: 50.77
336
+ - type: precision_at_1
337
+ value: 43.376
338
+ - type: precision_at_10
339
+ value: 10.236
340
+ - type: precision_at_100
341
+ value: 1.5730000000000002
342
+ - type: precision_at_1000
343
+ value: 0.203
344
+ - type: precision_at_3
345
+ value: 23.97
346
+ - type: precision_at_5
347
+ value: 17.134
348
+ - type: recall_at_1
349
+ value: 34.183
350
+ - type: recall_at_10
351
+ value: 64.866
352
+ - type: recall_at_100
353
+ value: 81.26100000000001
354
+ - type: recall_at_1000
355
+ value: 91.412
356
+ - type: recall_at_3
357
+ value: 50.080000000000005
358
+ - type: recall_at_5
359
+ value: 56.871
360
+ - task:
361
+ type: Retrieval
362
+ dataset:
363
+ type: cqadupstack/gaming
364
+ name: MTEB CQADupstackGamingRetrieval
365
+ config: default
366
+ split: test
367
+ revision: None
368
+ metrics:
369
+ - type: map_at_1
370
+ value: 44.878
371
+ - type: map_at_10
372
+ value: 58.656
373
+ - type: map_at_100
374
+ value: 59.668
375
+ - type: map_at_1000
376
+ value: 59.704
377
+ - type: map_at_3
378
+ value: 54.891
379
+ - type: map_at_5
380
+ value: 57.050999999999995
381
+ - type: mrr_at_1
382
+ value: 51.975
383
+ - type: mrr_at_10
384
+ value: 62.357
385
+ - type: mrr_at_100
386
+ value: 62.907999999999994
387
+ - type: mrr_at_1000
388
+ value: 62.925
389
+ - type: mrr_at_3
390
+ value: 59.801
391
+ - type: mrr_at_5
392
+ value: 61.278
393
+ - type: ndcg_at_1
394
+ value: 51.975
395
+ - type: ndcg_at_10
396
+ value: 64.95100000000001
397
+ - type: ndcg_at_100
398
+ value: 68.414
399
+ - type: ndcg_at_1000
400
+ value: 69.077
401
+ - type: ndcg_at_3
402
+ value: 58.897999999999996
403
+ - type: ndcg_at_5
404
+ value: 61.866
405
+ - type: precision_at_1
406
+ value: 51.975
407
+ - type: precision_at_10
408
+ value: 10.502
409
+ - type: precision_at_100
410
+ value: 1.31
411
+ - type: precision_at_1000
412
+ value: 0.13899999999999998
413
+ - type: precision_at_3
414
+ value: 26.290000000000003
415
+ - type: precision_at_5
416
+ value: 18.093999999999998
417
+ - type: recall_at_1
418
+ value: 44.878
419
+ - type: recall_at_10
420
+ value: 79.746
421
+ - type: recall_at_100
422
+ value: 94.17
423
+ - type: recall_at_1000
424
+ value: 98.80499999999999
425
+ - type: recall_at_3
426
+ value: 63.70099999999999
427
+ - type: recall_at_5
428
+ value: 70.878
429
+ - task:
430
+ type: Retrieval
431
+ dataset:
432
+ type: cqadupstack/gis
433
+ name: MTEB CQADupstackGisRetrieval
434
+ config: default
435
+ split: test
436
+ revision: None
437
+ metrics:
438
+ - type: map_at_1
439
+ value: 28.807
440
+ - type: map_at_10
441
+ value: 39.431
442
+ - type: map_at_100
443
+ value: 40.56
444
+ - type: map_at_1000
445
+ value: 40.617999999999995
446
+ - type: map_at_3
447
+ value: 36.436
448
+ - type: map_at_5
449
+ value: 37.955
450
+ - type: mrr_at_1
451
+ value: 31.186000000000003
452
+ - type: mrr_at_10
453
+ value: 41.654
454
+ - type: mrr_at_100
455
+ value: 42.58
456
+ - type: mrr_at_1000
457
+ value: 42.623
458
+ - type: mrr_at_3
459
+ value: 38.983000000000004
460
+ - type: mrr_at_5
461
+ value: 40.35
462
+ - type: ndcg_at_1
463
+ value: 31.186000000000003
464
+ - type: ndcg_at_10
465
+ value: 45.297
466
+ - type: ndcg_at_100
467
+ value: 50.515
468
+ - type: ndcg_at_1000
469
+ value: 52.005
470
+ - type: ndcg_at_3
471
+ value: 39.602
472
+ - type: ndcg_at_5
473
+ value: 42.027
474
+ - type: precision_at_1
475
+ value: 31.186000000000003
476
+ - type: precision_at_10
477
+ value: 7.073
478
+ - type: precision_at_100
479
+ value: 1.0210000000000001
480
+ - type: precision_at_1000
481
+ value: 0.11900000000000001
482
+ - type: precision_at_3
483
+ value: 17.1
484
+ - type: precision_at_5
485
+ value: 11.729000000000001
486
+ - type: recall_at_1
487
+ value: 28.807
488
+ - type: recall_at_10
489
+ value: 61.138999999999996
490
+ - type: recall_at_100
491
+ value: 84.491
492
+ - type: recall_at_1000
493
+ value: 95.651
494
+ - type: recall_at_3
495
+ value: 45.652
496
+ - type: recall_at_5
497
+ value: 51.522
498
+ - task:
499
+ type: Retrieval
500
+ dataset:
501
+ type: cqadupstack/mathematica
502
+ name: MTEB CQADupstackMathematicaRetrieval
503
+ config: default
504
+ split: test
505
+ revision: None
506
+ metrics:
507
+ - type: map_at_1
508
+ value: 20.607
509
+ - type: map_at_10
510
+ value: 31.944
511
+ - type: map_at_100
512
+ value: 33.317
513
+ - type: map_at_1000
514
+ value: 33.428000000000004
515
+ - type: map_at_3
516
+ value: 28.508
517
+ - type: map_at_5
518
+ value: 30.348999999999997
519
+ - type: mrr_at_1
520
+ value: 25.622
521
+ - type: mrr_at_10
522
+ value: 36.726
523
+ - type: mrr_at_100
524
+ value: 37.707
525
+ - type: mrr_at_1000
526
+ value: 37.761
527
+ - type: mrr_at_3
528
+ value: 33.934
529
+ - type: mrr_at_5
530
+ value: 35.452
531
+ - type: ndcg_at_1
532
+ value: 25.622
533
+ - type: ndcg_at_10
534
+ value: 38.462
535
+ - type: ndcg_at_100
536
+ value: 44.327
537
+ - type: ndcg_at_1000
538
+ value: 46.623
539
+ - type: ndcg_at_3
540
+ value: 32.583
541
+ - type: ndcg_at_5
542
+ value: 35.175
543
+ - type: precision_at_1
544
+ value: 25.622
545
+ - type: precision_at_10
546
+ value: 7.425
547
+ - type: precision_at_100
548
+ value: 1.173
549
+ - type: precision_at_1000
550
+ value: 0.149
551
+ - type: precision_at_3
552
+ value: 16.418
553
+ - type: precision_at_5
554
+ value: 11.866
555
+ - type: recall_at_1
556
+ value: 20.607
557
+ - type: recall_at_10
558
+ value: 53.337
559
+ - type: recall_at_100
560
+ value: 78.133
561
+ - type: recall_at_1000
562
+ value: 94.151
563
+ - type: recall_at_3
564
+ value: 37.088
565
+ - type: recall_at_5
566
+ value: 43.627
567
+ - task:
568
+ type: Retrieval
569
+ dataset:
570
+ type: cqadupstack/physics
571
+ name: MTEB CQADupstackPhysicsRetrieval
572
+ config: default
573
+ split: test
574
+ revision: None
575
+ metrics:
576
+ - type: map_at_1
577
+ value: 33.814
578
+ - type: map_at_10
579
+ value: 47.609
580
+ - type: map_at_100
581
+ value: 48.972
582
+ - type: map_at_1000
583
+ value: 49.061
584
+ - type: map_at_3
585
+ value: 43.397999999999996
586
+ - type: map_at_5
587
+ value: 45.839
588
+ - type: mrr_at_1
589
+ value: 42.059999999999995
590
+ - type: mrr_at_10
591
+ value: 53.074
592
+ - type: mrr_at_100
593
+ value: 53.76800000000001
594
+ - type: mrr_at_1000
595
+ value: 53.794
596
+ - type: mrr_at_3
597
+ value: 50.241
598
+ - type: mrr_at_5
599
+ value: 51.805
600
+ - type: ndcg_at_1
601
+ value: 42.059999999999995
602
+ - type: ndcg_at_10
603
+ value: 54.419
604
+ - type: ndcg_at_100
605
+ value: 59.508
606
+ - type: ndcg_at_1000
607
+ value: 60.858000000000004
608
+ - type: ndcg_at_3
609
+ value: 48.296
610
+ - type: ndcg_at_5
611
+ value: 51.28
612
+ - type: precision_at_1
613
+ value: 42.059999999999995
614
+ - type: precision_at_10
615
+ value: 10.231
616
+ - type: precision_at_100
617
+ value: 1.4789999999999999
618
+ - type: precision_at_1000
619
+ value: 0.17700000000000002
620
+ - type: precision_at_3
621
+ value: 23.419999999999998
622
+ - type: precision_at_5
623
+ value: 16.843
624
+ - type: recall_at_1
625
+ value: 33.814
626
+ - type: recall_at_10
627
+ value: 68.88
628
+ - type: recall_at_100
629
+ value: 89.794
630
+ - type: recall_at_1000
631
+ value: 98.058
632
+ - type: recall_at_3
633
+ value: 51.915
634
+ - type: recall_at_5
635
+ value: 59.704
636
+ - task:
637
+ type: Retrieval
638
+ dataset:
639
+ type: cqadupstack/programmers
640
+ name: MTEB CQADupstackProgrammersRetrieval
641
+ config: default
642
+ split: test
643
+ revision: None
644
+ metrics:
645
+ - type: map_at_1
646
+ value: 29.668
647
+ - type: map_at_10
648
+ value: 43.032
649
+ - type: map_at_100
650
+ value: 44.48
651
+ - type: map_at_1000
652
+ value: 44.574000000000005
653
+ - type: map_at_3
654
+ value: 38.609
655
+ - type: map_at_5
656
+ value: 41.164
657
+ - type: mrr_at_1
658
+ value: 37.785000000000004
659
+ - type: mrr_at_10
660
+ value: 48.898
661
+ - type: mrr_at_100
662
+ value: 49.728
663
+ - type: mrr_at_1000
664
+ value: 49.769000000000005
665
+ - type: mrr_at_3
666
+ value: 45.909
667
+ - type: mrr_at_5
668
+ value: 47.61
669
+ - type: ndcg_at_1
670
+ value: 37.785000000000004
671
+ - type: ndcg_at_10
672
+ value: 50.21099999999999
673
+ - type: ndcg_at_100
674
+ value: 55.657999999999994
675
+ - type: ndcg_at_1000
676
+ value: 57.172
677
+ - type: ndcg_at_3
678
+ value: 43.726
679
+ - type: ndcg_at_5
680
+ value: 46.758
681
+ - type: precision_at_1
682
+ value: 37.785000000000004
683
+ - type: precision_at_10
684
+ value: 9.669
685
+ - type: precision_at_100
686
+ value: 1.4409999999999998
687
+ - type: precision_at_1000
688
+ value: 0.174
689
+ - type: precision_at_3
690
+ value: 21.651
691
+ - type: precision_at_5
692
+ value: 15.822
693
+ - type: recall_at_1
694
+ value: 29.668
695
+ - type: recall_at_10
696
+ value: 65.575
697
+ - type: recall_at_100
698
+ value: 87.977
699
+ - type: recall_at_1000
700
+ value: 97.615
701
+ - type: recall_at_3
702
+ value: 47.251
703
+ - type: recall_at_5
704
+ value: 55.359
705
+ - task:
706
+ type: Retrieval
707
+ dataset:
708
+ type: mteb/cqadupstack
709
+ name: MTEB CQADupstackRetrieval
710
+ config: default
711
+ split: test
712
+ revision: None
713
+ metrics:
714
+ - type: map_at_1
715
+ value: 30.29925
716
+ - type: map_at_10
717
+ value: 41.98708333333333
718
+ - type: map_at_100
719
+ value: 43.306916666666666
720
+ - type: map_at_1000
721
+ value: 43.40716666666667
722
+ - type: map_at_3
723
+ value: 38.431666666666665
724
+ - type: map_at_5
725
+ value: 40.4195
726
+ - type: mrr_at_1
727
+ value: 36.24483333333334
728
+ - type: mrr_at_10
729
+ value: 46.32666666666667
730
+ - type: mrr_at_100
731
+ value: 47.13983333333333
732
+ - type: mrr_at_1000
733
+ value: 47.18058333333334
734
+ - type: mrr_at_3
735
+ value: 43.66799999999999
736
+ - type: mrr_at_5
737
+ value: 45.163666666666664
738
+ - type: ndcg_at_1
739
+ value: 36.24483333333334
740
+ - type: ndcg_at_10
741
+ value: 48.251916666666666
742
+ - type: ndcg_at_100
743
+ value: 53.3555
744
+ - type: ndcg_at_1000
745
+ value: 55.024249999999995
746
+ - type: ndcg_at_3
747
+ value: 42.599583333333335
748
+ - type: ndcg_at_5
749
+ value: 45.24166666666666
750
+ - type: precision_at_1
751
+ value: 36.24483333333334
752
+ - type: precision_at_10
753
+ value: 8.666833333333333
754
+ - type: precision_at_100
755
+ value: 1.3214166666666665
756
+ - type: precision_at_1000
757
+ value: 0.16475
758
+ - type: precision_at_3
759
+ value: 19.9955
760
+ - type: precision_at_5
761
+ value: 14.271999999999998
762
+ - type: recall_at_1
763
+ value: 30.29925
764
+ - type: recall_at_10
765
+ value: 62.232333333333344
766
+ - type: recall_at_100
767
+ value: 84.151
768
+ - type: recall_at_1000
769
+ value: 95.37333333333333
770
+ - type: recall_at_3
771
+ value: 46.45541666666667
772
+ - type: recall_at_5
773
+ value: 53.264
774
+ - task:
775
+ type: Retrieval
776
+ dataset:
777
+ type: cqadupstack/stats
778
+ name: MTEB CQADupstackStatsRetrieval
779
+ config: default
780
+ split: test
781
+ revision: None
782
+ metrics:
783
+ - type: map_at_1
784
+ value: 28.996
785
+ - type: map_at_10
786
+ value: 38.047
787
+ - type: map_at_100
788
+ value: 39.121
789
+ - type: map_at_1000
790
+ value: 39.202999999999996
791
+ - type: map_at_3
792
+ value: 35.376000000000005
793
+ - type: map_at_5
794
+ value: 36.763
795
+ - type: mrr_at_1
796
+ value: 32.362
797
+ - type: mrr_at_10
798
+ value: 40.717999999999996
799
+ - type: mrr_at_100
800
+ value: 41.586
801
+ - type: mrr_at_1000
802
+ value: 41.641
803
+ - type: mrr_at_3
804
+ value: 38.292
805
+ - type: mrr_at_5
806
+ value: 39.657
807
+ - type: ndcg_at_1
808
+ value: 32.362
809
+ - type: ndcg_at_10
810
+ value: 43.105
811
+ - type: ndcg_at_100
812
+ value: 48.026
813
+ - type: ndcg_at_1000
814
+ value: 49.998
815
+ - type: ndcg_at_3
816
+ value: 38.147999999999996
817
+ - type: ndcg_at_5
818
+ value: 40.385
819
+ - type: precision_at_1
820
+ value: 32.362
821
+ - type: precision_at_10
822
+ value: 6.7940000000000005
823
+ - type: precision_at_100
824
+ value: 1.0170000000000001
825
+ - type: precision_at_1000
826
+ value: 0.125
827
+ - type: precision_at_3
828
+ value: 16.411
829
+ - type: precision_at_5
830
+ value: 11.35
831
+ - type: recall_at_1
832
+ value: 28.996
833
+ - type: recall_at_10
834
+ value: 55.955
835
+ - type: recall_at_100
836
+ value: 77.744
837
+ - type: recall_at_1000
838
+ value: 92.196
839
+ - type: recall_at_3
840
+ value: 42.254999999999995
841
+ - type: recall_at_5
842
+ value: 47.776
843
+ - task:
844
+ type: Retrieval
845
+ dataset:
846
+ type: cqadupstack/tex
847
+ name: MTEB CQADupstackTexRetrieval
848
+ config: default
849
+ split: test
850
+ revision: None
851
+ metrics:
852
+ - type: map_at_1
853
+ value: 20.029
854
+ - type: map_at_10
855
+ value: 29.188
856
+ - type: map_at_100
857
+ value: 30.484
858
+ - type: map_at_1000
859
+ value: 30.608
860
+ - type: map_at_3
861
+ value: 26.195
862
+ - type: map_at_5
863
+ value: 27.866999999999997
864
+ - type: mrr_at_1
865
+ value: 24.57
866
+ - type: mrr_at_10
867
+ value: 33.461
868
+ - type: mrr_at_100
869
+ value: 34.398
870
+ - type: mrr_at_1000
871
+ value: 34.464
872
+ - type: mrr_at_3
873
+ value: 30.856
874
+ - type: mrr_at_5
875
+ value: 32.322
876
+ - type: ndcg_at_1
877
+ value: 24.57
878
+ - type: ndcg_at_10
879
+ value: 34.846
880
+ - type: ndcg_at_100
881
+ value: 40.544000000000004
882
+ - type: ndcg_at_1000
883
+ value: 43.019
884
+ - type: ndcg_at_3
885
+ value: 29.683999999999997
886
+ - type: ndcg_at_5
887
+ value: 32.11
888
+ - type: precision_at_1
889
+ value: 24.57
890
+ - type: precision_at_10
891
+ value: 6.535
892
+ - type: precision_at_100
893
+ value: 1.11
894
+ - type: precision_at_1000
895
+ value: 0.149
896
+ - type: precision_at_3
897
+ value: 14.338000000000001
898
+ - type: precision_at_5
899
+ value: 10.496
900
+ - type: recall_at_1
901
+ value: 20.029
902
+ - type: recall_at_10
903
+ value: 47.509
904
+ - type: recall_at_100
905
+ value: 72.61999999999999
906
+ - type: recall_at_1000
907
+ value: 89.778
908
+ - type: recall_at_3
909
+ value: 33.031
910
+ - type: recall_at_5
911
+ value: 39.306000000000004
912
+ - task:
913
+ type: Retrieval
914
+ dataset:
915
+ type: cqadupstack/unix
916
+ name: MTEB CQADupstackUnixRetrieval
917
+ config: default
918
+ split: test
919
+ revision: None
920
+ metrics:
921
+ - type: map_at_1
922
+ value: 31.753999999999998
923
+ - type: map_at_10
924
+ value: 43.814
925
+ - type: map_at_100
926
+ value: 45.072
927
+ - type: map_at_1000
928
+ value: 45.155
929
+ - type: map_at_3
930
+ value: 40.316
931
+ - type: map_at_5
932
+ value: 42.15
933
+ - type: mrr_at_1
934
+ value: 38.06
935
+ - type: mrr_at_10
936
+ value: 48.311
937
+ - type: mrr_at_100
938
+ value: 49.145
939
+ - type: mrr_at_1000
940
+ value: 49.181000000000004
941
+ - type: mrr_at_3
942
+ value: 45.678000000000004
943
+ - type: mrr_at_5
944
+ value: 47.072
945
+ - type: ndcg_at_1
946
+ value: 38.06
947
+ - type: ndcg_at_10
948
+ value: 50.083
949
+ - type: ndcg_at_100
950
+ value: 55.342
951
+ - type: ndcg_at_1000
952
+ value: 56.87
953
+ - type: ndcg_at_3
954
+ value: 44.513999999999996
955
+ - type: ndcg_at_5
956
+ value: 46.886
957
+ - type: precision_at_1
958
+ value: 38.06
959
+ - type: precision_at_10
960
+ value: 8.638
961
+ - type: precision_at_100
962
+ value: 1.253
963
+ - type: precision_at_1000
964
+ value: 0.149
965
+ - type: precision_at_3
966
+ value: 20.709
967
+ - type: precision_at_5
968
+ value: 14.44
969
+ - type: recall_at_1
970
+ value: 31.753999999999998
971
+ - type: recall_at_10
972
+ value: 64.473
973
+ - type: recall_at_100
974
+ value: 86.832
975
+ - type: recall_at_1000
976
+ value: 96.706
977
+ - type: recall_at_3
978
+ value: 48.937000000000005
979
+ - type: recall_at_5
980
+ value: 55.214
981
+ - task:
982
+ type: Retrieval
983
+ dataset:
984
+ type: cqadupstack/webmasters
985
+ name: MTEB CQADupstackWebmastersRetrieval
986
+ config: default
987
+ split: test
988
+ revision: None
989
+ metrics:
990
+ - type: map_at_1
991
+ value: 28.815
992
+ - type: map_at_10
993
+ value: 40.595
994
+ - type: map_at_100
995
+ value: 42.337
996
+ - type: map_at_1000
997
+ value: 42.559000000000005
998
+ - type: map_at_3
999
+ value: 37.120999999999995
1000
+ - type: map_at_5
1001
+ value: 38.912
1002
+ - type: mrr_at_1
1003
+ value: 34.585
1004
+ - type: mrr_at_10
1005
+ value: 45.068000000000005
1006
+ - type: mrr_at_100
1007
+ value: 45.93
1008
+ - type: mrr_at_1000
1009
+ value: 45.974
1010
+ - type: mrr_at_3
1011
+ value: 42.26
1012
+ - type: mrr_at_5
1013
+ value: 43.742
1014
+ - type: ndcg_at_1
1015
+ value: 34.585
1016
+ - type: ndcg_at_10
1017
+ value: 47.519
1018
+ - type: ndcg_at_100
1019
+ value: 53.102000000000004
1020
+ - type: ndcg_at_1000
1021
+ value: 54.949999999999996
1022
+ - type: ndcg_at_3
1023
+ value: 41.719
1024
+ - type: ndcg_at_5
1025
+ value: 44.17
1026
+ - type: precision_at_1
1027
+ value: 34.585
1028
+ - type: precision_at_10
1029
+ value: 9.368
1030
+ - type: precision_at_100
1031
+ value: 1.7870000000000001
1032
+ - type: precision_at_1000
1033
+ value: 0.254
1034
+ - type: precision_at_3
1035
+ value: 19.895
1036
+ - type: precision_at_5
1037
+ value: 14.506
1038
+ - type: recall_at_1
1039
+ value: 28.815
1040
+ - type: recall_at_10
1041
+ value: 61.414
1042
+ - type: recall_at_100
1043
+ value: 85.922
1044
+ - type: recall_at_1000
1045
+ value: 97.15
1046
+ - type: recall_at_3
1047
+ value: 45.076
1048
+ - type: recall_at_5
1049
+ value: 51.271
1050
+ - task:
1051
+ type: Retrieval
1052
+ dataset:
1053
+ type: cqadupstack/wordpress
1054
+ name: MTEB CQADupstackWordpressRetrieval
1055
+ config: default
1056
+ split: test
1057
+ revision: None
1058
+ metrics:
1059
+ - type: map_at_1
1060
+ value: 24.298000000000002
1061
+ - type: map_at_10
1062
+ value: 32.889
1063
+ - type: map_at_100
1064
+ value: 33.989999999999995
1065
+ - type: map_at_1000
1066
+ value: 34.074
1067
+ - type: map_at_3
1068
+ value: 29.873
1069
+ - type: map_at_5
1070
+ value: 31.539
1071
+ - type: mrr_at_1
1072
+ value: 26.433
1073
+ - type: mrr_at_10
1074
+ value: 34.937000000000005
1075
+ - type: mrr_at_100
1076
+ value: 35.914
1077
+ - type: mrr_at_1000
1078
+ value: 35.96
1079
+ - type: mrr_at_3
1080
+ value: 32.286
1081
+ - type: mrr_at_5
1082
+ value: 33.663
1083
+ - type: ndcg_at_1
1084
+ value: 26.433
1085
+ - type: ndcg_at_10
1086
+ value: 38.173
1087
+ - type: ndcg_at_100
1088
+ value: 43.884
1089
+ - type: ndcg_at_1000
1090
+ value: 45.916000000000004
1091
+ - type: ndcg_at_3
1092
+ value: 32.419
1093
+ - type: ndcg_at_5
1094
+ value: 35.092
1095
+ - type: precision_at_1
1096
+ value: 26.433
1097
+ - type: precision_at_10
1098
+ value: 6.1
1099
+ - type: precision_at_100
1100
+ value: 0.963
1101
+ - type: precision_at_1000
1102
+ value: 0.126
1103
+ - type: precision_at_3
1104
+ value: 13.802
1105
+ - type: precision_at_5
1106
+ value: 9.871
1107
+ - type: recall_at_1
1108
+ value: 24.298000000000002
1109
+ - type: recall_at_10
1110
+ value: 52.554
1111
+ - type: recall_at_100
1112
+ value: 79.345
1113
+ - type: recall_at_1000
1114
+ value: 94.464
1115
+ - type: recall_at_3
1116
+ value: 37.036
1117
+ - type: recall_at_5
1118
+ value: 43.518
1119
+ - task:
1120
+ type: Retrieval
1121
+ dataset:
1122
+ type: climate-fever
1123
+ name: MTEB ClimateFEVER
1124
+ config: default
1125
+ split: test
1126
+ revision: None
1127
+ metrics:
1128
+ - type: map_at_1
1129
+ value: 14.194999999999999
1130
+ - type: map_at_10
1131
+ value: 24.563
1132
+ - type: map_at_100
1133
+ value: 26.775
1134
+ - type: map_at_1000
1135
+ value: 26.965
1136
+ - type: map_at_3
1137
+ value: 19.983999999999998
1138
+ - type: map_at_5
1139
+ value: 22.24
1140
+ - type: mrr_at_1
1141
+ value: 31.661
1142
+ - type: mrr_at_10
1143
+ value: 44.804
1144
+ - type: mrr_at_100
1145
+ value: 45.655
1146
+ - type: mrr_at_1000
1147
+ value: 45.678000000000004
1148
+ - type: mrr_at_3
1149
+ value: 41.292
1150
+ - type: mrr_at_5
1151
+ value: 43.468
1152
+ - type: ndcg_at_1
1153
+ value: 31.661
1154
+ - type: ndcg_at_10
1155
+ value: 34.271
1156
+ - type: ndcg_at_100
1157
+ value: 42.04
1158
+ - type: ndcg_at_1000
1159
+ value: 45.101
1160
+ - type: ndcg_at_3
1161
+ value: 27.529999999999998
1162
+ - type: ndcg_at_5
1163
+ value: 29.862
1164
+ - type: precision_at_1
1165
+ value: 31.661
1166
+ - type: precision_at_10
1167
+ value: 10.925
1168
+ - type: precision_at_100
1169
+ value: 1.92
1170
+ - type: precision_at_1000
1171
+ value: 0.25
1172
+ - type: precision_at_3
1173
+ value: 20.456
1174
+ - type: precision_at_5
1175
+ value: 16.012999999999998
1176
+ - type: recall_at_1
1177
+ value: 14.194999999999999
1178
+ - type: recall_at_10
1179
+ value: 41.388999999999996
1180
+ - type: recall_at_100
1181
+ value: 67.58800000000001
1182
+ - type: recall_at_1000
1183
+ value: 84.283
1184
+ - type: recall_at_3
1185
+ value: 25.089
1186
+ - type: recall_at_5
1187
+ value: 31.642
1188
+ - task:
1189
+ type: Retrieval
1190
+ dataset:
1191
+ type: dbpedia-entity
1192
+ name: MTEB DBPedia
1193
+ config: default
1194
+ split: test
1195
+ revision: None
1196
+ metrics:
1197
+ - type: map_at_1
1198
+ value: 9.898
1199
+ - type: map_at_10
1200
+ value: 23.226
1201
+ - type: map_at_100
1202
+ value: 33.372
1203
+ - type: map_at_1000
1204
+ value: 35.407
1205
+ - type: map_at_3
1206
+ value: 15.892999999999999
1207
+ - type: map_at_5
1208
+ value: 18.747
1209
+ - type: mrr_at_1
1210
+ value: 73.5
1211
+ - type: mrr_at_10
1212
+ value: 80.404
1213
+ - type: mrr_at_100
1214
+ value: 80.671
1215
+ - type: mrr_at_1000
1216
+ value: 80.676
1217
+ - type: mrr_at_3
1218
+ value: 78.958
1219
+ - type: mrr_at_5
1220
+ value: 79.683
1221
+ - type: ndcg_at_1
1222
+ value: 62.0
1223
+ - type: ndcg_at_10
1224
+ value: 48.337
1225
+ - type: ndcg_at_100
1226
+ value: 53.474
1227
+ - type: ndcg_at_1000
1228
+ value: 60.999
1229
+ - type: ndcg_at_3
1230
+ value: 52.538
1231
+ - type: ndcg_at_5
1232
+ value: 49.659
1233
+ - type: precision_at_1
1234
+ value: 73.5
1235
+ - type: precision_at_10
1236
+ value: 39.25
1237
+ - type: precision_at_100
1238
+ value: 12.4
1239
+ - type: precision_at_1000
1240
+ value: 2.4459999999999997
1241
+ - type: precision_at_3
1242
+ value: 56.333
1243
+ - type: precision_at_5
1244
+ value: 48.15
1245
+ - type: recall_at_1
1246
+ value: 9.898
1247
+ - type: recall_at_10
1248
+ value: 29.511
1249
+ - type: recall_at_100
1250
+ value: 60.45700000000001
1251
+ - type: recall_at_1000
1252
+ value: 84.47200000000001
1253
+ - type: recall_at_3
1254
+ value: 17.064
1255
+ - type: recall_at_5
1256
+ value: 21.258
1257
+ - task:
1258
+ type: Classification
1259
+ dataset:
1260
+ type: mteb/emotion
1261
+ name: MTEB EmotionClassification
1262
+ config: default
1263
+ split: test
1264
+ revision: 4f58c6b202a23cf9a4da393831edf4f9183cad37
1265
+ metrics:
1266
+ - type: accuracy
1267
+ value: 51.19999999999999
1268
+ - type: f1
1269
+ value: 46.23854137552949
1270
+ - task:
1271
+ type: Retrieval
1272
+ dataset:
1273
+ type: fever
1274
+ name: MTEB FEVER
1275
+ config: default
1276
+ split: test
1277
+ revision: None
1278
+ metrics:
1279
+ - type: map_at_1
1280
+ value: 80.093
1281
+ - type: map_at_10
1282
+ value: 87.139
1283
+ - type: map_at_100
1284
+ value: 87.333
1285
+ - type: map_at_1000
1286
+ value: 87.344
1287
+ - type: map_at_3
1288
+ value: 86.395
1289
+ - type: map_at_5
1290
+ value: 86.866
1291
+ - type: mrr_at_1
1292
+ value: 86.36399999999999
1293
+ - type: mrr_at_10
1294
+ value: 91.867
1295
+ - type: mrr_at_100
1296
+ value: 91.906
1297
+ - type: mrr_at_1000
1298
+ value: 91.90700000000001
1299
+ - type: mrr_at_3
1300
+ value: 91.484
1301
+ - type: mrr_at_5
1302
+ value: 91.759
1303
+ - type: ndcg_at_1
1304
+ value: 86.36399999999999
1305
+ - type: ndcg_at_10
1306
+ value: 90.197
1307
+ - type: ndcg_at_100
1308
+ value: 90.819
1309
+ - type: ndcg_at_1000
1310
+ value: 91.01599999999999
1311
+ - type: ndcg_at_3
1312
+ value: 89.166
1313
+ - type: ndcg_at_5
1314
+ value: 89.74
1315
+ - type: precision_at_1
1316
+ value: 86.36399999999999
1317
+ - type: precision_at_10
1318
+ value: 10.537
1319
+ - type: precision_at_100
1320
+ value: 1.106
1321
+ - type: precision_at_1000
1322
+ value: 0.11399999999999999
1323
+ - type: precision_at_3
1324
+ value: 33.608
1325
+ - type: precision_at_5
1326
+ value: 20.618
1327
+ - type: recall_at_1
1328
+ value: 80.093
1329
+ - type: recall_at_10
1330
+ value: 95.003
1331
+ - type: recall_at_100
1332
+ value: 97.328
1333
+ - type: recall_at_1000
1334
+ value: 98.485
1335
+ - type: recall_at_3
1336
+ value: 92.072
1337
+ - type: recall_at_5
1338
+ value: 93.661
1339
+ - task:
1340
+ type: Retrieval
1341
+ dataset:
1342
+ type: fiqa
1343
+ name: MTEB FiQA2018
1344
+ config: default
1345
+ split: test
1346
+ revision: None
1347
+ metrics:
1348
+ - type: map_at_1
1349
+ value: 29.063
1350
+ - type: map_at_10
1351
+ value: 47.113
1352
+ - type: map_at_100
1353
+ value: 49.294
1354
+ - type: map_at_1000
1355
+ value: 49.422
1356
+ - type: map_at_3
1357
+ value: 40.955000000000005
1358
+ - type: map_at_5
1359
+ value: 44.5
1360
+ - type: mrr_at_1
1361
+ value: 55.401
1362
+ - type: mrr_at_10
1363
+ value: 62.99400000000001
1364
+ - type: mrr_at_100
1365
+ value: 63.63999999999999
1366
+ - type: mrr_at_1000
1367
+ value: 63.661
1368
+ - type: mrr_at_3
1369
+ value: 61.034
1370
+ - type: mrr_at_5
1371
+ value: 62.253
1372
+ - type: ndcg_at_1
1373
+ value: 55.401
1374
+ - type: ndcg_at_10
1375
+ value: 55.332
1376
+ - type: ndcg_at_100
1377
+ value: 61.931000000000004
1378
+ - type: ndcg_at_1000
1379
+ value: 63.841
1380
+ - type: ndcg_at_3
1381
+ value: 50.92
1382
+ - type: ndcg_at_5
1383
+ value: 52.525
1384
+ - type: precision_at_1
1385
+ value: 55.401
1386
+ - type: precision_at_10
1387
+ value: 15.262
1388
+ - type: precision_at_100
1389
+ value: 2.231
1390
+ - type: precision_at_1000
1391
+ value: 0.256
1392
+ - type: precision_at_3
1393
+ value: 33.848
1394
+ - type: precision_at_5
1395
+ value: 25.031
1396
+ - type: recall_at_1
1397
+ value: 29.063
1398
+ - type: recall_at_10
1399
+ value: 62.498
1400
+ - type: recall_at_100
1401
+ value: 85.86
1402
+ - type: recall_at_1000
1403
+ value: 97.409
1404
+ - type: recall_at_3
1405
+ value: 45.472
1406
+ - type: recall_at_5
1407
+ value: 53.344
1408
+ - task:
1409
+ type: Retrieval
1410
+ dataset:
1411
+ type: hotpotqa
1412
+ name: MTEB HotpotQA
1413
+ config: default
1414
+ split: test
1415
+ revision: None
1416
+ metrics:
1417
+ - type: map_at_1
1418
+ value: 37.205
1419
+ - type: map_at_10
1420
+ value: 64.19399999999999
1421
+ - type: map_at_100
1422
+ value: 65.183
1423
+ - type: map_at_1000
1424
+ value: 65.23299999999999
1425
+ - type: map_at_3
1426
+ value: 60.239
1427
+ - type: map_at_5
1428
+ value: 62.695
1429
+ - type: mrr_at_1
1430
+ value: 74.409
1431
+ - type: mrr_at_10
1432
+ value: 80.84
1433
+ - type: mrr_at_100
1434
+ value: 81.10199999999999
1435
+ - type: mrr_at_1000
1436
+ value: 81.109
1437
+ - type: mrr_at_3
1438
+ value: 79.739
1439
+ - type: mrr_at_5
1440
+ value: 80.46600000000001
1441
+ - type: ndcg_at_1
1442
+ value: 74.409
1443
+ - type: ndcg_at_10
1444
+ value: 71.757
1445
+ - type: ndcg_at_100
1446
+ value: 75.152
1447
+ - type: ndcg_at_1000
1448
+ value: 76.098
1449
+ - type: ndcg_at_3
1450
+ value: 66.174
1451
+ - type: ndcg_at_5
1452
+ value: 69.283
1453
+ - type: precision_at_1
1454
+ value: 74.409
1455
+ - type: precision_at_10
1456
+ value: 15.503
1457
+ - type: precision_at_100
1458
+ value: 1.8110000000000002
1459
+ - type: precision_at_1000
1460
+ value: 0.194
1461
+ - type: precision_at_3
1462
+ value: 43.457
1463
+ - type: precision_at_5
1464
+ value: 28.532000000000004
1465
+ - type: recall_at_1
1466
+ value: 37.205
1467
+ - type: recall_at_10
1468
+ value: 77.515
1469
+ - type: recall_at_100
1470
+ value: 90.56
1471
+ - type: recall_at_1000
1472
+ value: 96.759
1473
+ - type: recall_at_3
1474
+ value: 65.18599999999999
1475
+ - type: recall_at_5
1476
+ value: 71.33
1477
+ - task:
1478
+ type: Classification
1479
+ dataset:
1480
+ type: mteb/imdb
1481
+ name: MTEB ImdbClassification
1482
+ config: default
1483
+ split: test
1484
+ revision: 3d86128a09e091d6018b6d26cad27f2739fc2db7
1485
+ metrics:
1486
+ - type: accuracy
1487
+ value: 82.9448
1488
+ - type: ap
1489
+ value: 78.25923353099166
1490
+ - type: f1
1491
+ value: 82.86422040179993
1492
+ - task:
1493
+ type: Retrieval
1494
+ dataset:
1495
+ type: msmarco
1496
+ name: MTEB MSMARCO
1497
+ config: default
1498
+ split: dev
1499
+ revision: None
1500
+ metrics:
1501
+ - type: map_at_1
1502
+ value: 22.834
1503
+ - type: map_at_10
1504
+ value: 35.85
1505
+ - type: map_at_100
1506
+ value: 37.013
1507
+ - type: map_at_1000
1508
+ value: 37.056
1509
+ - type: map_at_3
1510
+ value: 31.613000000000003
1511
+ - type: map_at_5
1512
+ value: 34.113
1513
+ - type: mrr_at_1
1514
+ value: 23.424
1515
+ - type: mrr_at_10
1516
+ value: 36.398
1517
+ - type: mrr_at_100
1518
+ value: 37.498
1519
+ - type: mrr_at_1000
1520
+ value: 37.534
1521
+ - type: mrr_at_3
1522
+ value: 32.275999999999996
1523
+ - type: mrr_at_5
1524
+ value: 34.705000000000005
1525
+ - type: ndcg_at_1
1526
+ value: 23.424
1527
+ - type: ndcg_at_10
1528
+ value: 43.236999999999995
1529
+ - type: ndcg_at_100
1530
+ value: 48.776
1531
+ - type: ndcg_at_1000
1532
+ value: 49.778
1533
+ - type: ndcg_at_3
1534
+ value: 34.692
1535
+ - type: ndcg_at_5
1536
+ value: 39.119
1537
+ - type: precision_at_1
1538
+ value: 23.424
1539
+ - type: precision_at_10
1540
+ value: 6.918
1541
+ - type: precision_at_100
1542
+ value: 0.9690000000000001
1543
+ - type: precision_at_1000
1544
+ value: 0.105
1545
+ - type: precision_at_3
1546
+ value: 14.881
1547
+ - type: precision_at_5
1548
+ value: 11.183
1549
+ - type: recall_at_1
1550
+ value: 22.834
1551
+ - type: recall_at_10
1552
+ value: 66.03999999999999
1553
+ - type: recall_at_100
1554
+ value: 91.532
1555
+ - type: recall_at_1000
1556
+ value: 99.068
1557
+ - type: recall_at_3
1558
+ value: 42.936
1559
+ - type: recall_at_5
1560
+ value: 53.539
1561
+ - task:
1562
+ type: Classification
1563
+ dataset:
1564
+ type: mteb/mtop_domain
1565
+ name: MTEB MTOPDomainClassification (en)
1566
+ config: en
1567
+ split: test
1568
+ revision: d80d48c1eb48d3562165c59d59d0034df9fff0bf
1569
+ metrics:
1570
+ - type: accuracy
1571
+ value: 96.1377108983128
1572
+ - type: f1
1573
+ value: 95.87034720246666
1574
+ - task:
1575
+ type: Classification
1576
+ dataset:
1577
+ type: mteb/mtop_intent
1578
+ name: MTEB MTOPIntentClassification (en)
1579
+ config: en
1580
+ split: test
1581
+ revision: ae001d0e6b1228650b7bd1c2c65fb50ad11a8aba
1582
+ metrics:
1583
+ - type: accuracy
1584
+ value: 86.10579115367078
1585
+ - type: f1
1586
+ value: 70.20810321445228
1587
+ - task:
1588
+ type: Classification
1589
+ dataset:
1590
+ type: mteb/amazon_massive_intent
1591
+ name: MTEB MassiveIntentClassification (en)
1592
+ config: en
1593
+ split: test
1594
+ revision: 31efe3c427b0bae9c22cbb560b8f15491cc6bed7
1595
+ metrics:
1596
+ - type: accuracy
1597
+ value: 79.80497646267652
1598
+ - type: f1
1599
+ value: 77.32475274059293
1600
+ - task:
1601
+ type: Classification
1602
+ dataset:
1603
+ type: mteb/amazon_massive_scenario
1604
+ name: MTEB MassiveScenarioClassification (en)
1605
+ config: en
1606
+ split: test
1607
+ revision: 7d571f92784cd94a019292a1f45445077d0ef634
1608
+ metrics:
1609
+ - type: accuracy
1610
+ value: 81.52320107599192
1611
+ - type: f1
1612
+ value: 81.22312939311655
1613
+ - task:
1614
+ type: Clustering
1615
+ dataset:
1616
+ type: mteb/medrxiv-clustering-p2p
1617
+ name: MTEB MedrxivClusteringP2P
1618
+ config: default
1619
+ split: test
1620
+ revision: e7a26af6f3ae46b30dde8737f02c07b1505bcc73
1621
+ metrics:
1622
+ - type: v_measure
1623
+ value: 30.709106678767018
1624
+ - task:
1625
+ type: Clustering
1626
+ dataset:
1627
+ type: mteb/medrxiv-clustering-s2s
1628
+ name: MTEB MedrxivClusteringS2S
1629
+ config: default
1630
+ split: test
1631
+ revision: 35191c8c0dca72d8ff3efcd72aa802307d469663
1632
+ metrics:
1633
+ - type: v_measure
1634
+ value: 32.95879128399585
1635
+ - task:
1636
+ type: Reranking
1637
+ dataset:
1638
+ type: mteb/mind_small
1639
+ name: MTEB MindSmallReranking
1640
+ config: default
1641
+ split: test
1642
+ revision: 3bdac13927fdc888b903db93b2ffdbd90b295a69
1643
+ metrics:
1644
+ - type: map
1645
+ value: 32.67476691128679
1646
+ - type: mrr
1647
+ value: 33.921654478513986
1648
+ - task:
1649
+ type: Retrieval
1650
+ dataset:
1651
+ type: nfcorpus
1652
+ name: MTEB NFCorpus
1653
+ config: default
1654
+ split: test
1655
+ revision: None
1656
+ metrics:
1657
+ - type: map_at_1
1658
+ value: 7.223
1659
+ - type: map_at_10
1660
+ value: 15.992999999999999
1661
+ - type: map_at_100
1662
+ value: 21.09
1663
+ - type: map_at_1000
1664
+ value: 22.822
1665
+ - type: map_at_3
1666
+ value: 11.475
1667
+ - type: map_at_5
1668
+ value: 13.501
1669
+ - type: mrr_at_1
1670
+ value: 53.251000000000005
1671
+ - type: mrr_at_10
1672
+ value: 61.878
1673
+ - type: mrr_at_100
1674
+ value: 62.307
1675
+ - type: mrr_at_1000
1676
+ value: 62.342
1677
+ - type: mrr_at_3
1678
+ value: 60.01
1679
+ - type: mrr_at_5
1680
+ value: 61.202
1681
+ - type: ndcg_at_1
1682
+ value: 51.702999999999996
1683
+ - type: ndcg_at_10
1684
+ value: 41.833999999999996
1685
+ - type: ndcg_at_100
1686
+ value: 39.061
1687
+ - type: ndcg_at_1000
1688
+ value: 47.397
1689
+ - type: ndcg_at_3
1690
+ value: 47.083000000000006
1691
+ - type: ndcg_at_5
1692
+ value: 44.722
1693
+ - type: precision_at_1
1694
+ value: 53.251000000000005
1695
+ - type: precision_at_10
1696
+ value: 31.3
1697
+ - type: precision_at_100
1698
+ value: 10.254000000000001
1699
+ - type: precision_at_1000
1700
+ value: 2.338
1701
+ - type: precision_at_3
1702
+ value: 43.756
1703
+ - type: precision_at_5
1704
+ value: 38.824
1705
+ - type: recall_at_1
1706
+ value: 7.223
1707
+ - type: recall_at_10
1708
+ value: 20.529
1709
+ - type: recall_at_100
1710
+ value: 39.818
1711
+ - type: recall_at_1000
1712
+ value: 70.152
1713
+ - type: recall_at_3
1714
+ value: 12.666
1715
+ - type: recall_at_5
1716
+ value: 15.798000000000002
1717
+ - task:
1718
+ type: Retrieval
1719
+ dataset:
1720
+ type: nq
1721
+ name: MTEB NQ
1722
+ config: default
1723
+ split: test
1724
+ revision: None
1725
+ metrics:
1726
+ - type: map_at_1
1727
+ value: 38.847
1728
+ - type: map_at_10
1729
+ value: 56.255
1730
+ - type: map_at_100
1731
+ value: 57.019
1732
+ - type: map_at_1000
1733
+ value: 57.03
1734
+ - type: map_at_3
1735
+ value: 51.665000000000006
1736
+ - type: map_at_5
1737
+ value: 54.543
1738
+ - type: mrr_at_1
1739
+ value: 43.801
1740
+ - type: mrr_at_10
1741
+ value: 58.733999999999995
1742
+ - type: mrr_at_100
1743
+ value: 59.206
1744
+ - type: mrr_at_1000
1745
+ value: 59.21300000000001
1746
+ - type: mrr_at_3
1747
+ value: 55.266999999999996
1748
+ - type: mrr_at_5
1749
+ value: 57.449
1750
+ - type: ndcg_at_1
1751
+ value: 43.772
1752
+ - type: ndcg_at_10
1753
+ value: 64.213
1754
+ - type: ndcg_at_100
1755
+ value: 67.13
1756
+ - type: ndcg_at_1000
1757
+ value: 67.368
1758
+ - type: ndcg_at_3
1759
+ value: 55.977
1760
+ - type: ndcg_at_5
1761
+ value: 60.597
1762
+ - type: precision_at_1
1763
+ value: 43.772
1764
+ - type: precision_at_10
1765
+ value: 10.272
1766
+ - type: precision_at_100
1767
+ value: 1.193
1768
+ - type: precision_at_1000
1769
+ value: 0.121
1770
+ - type: precision_at_3
1771
+ value: 25.261
1772
+ - type: precision_at_5
1773
+ value: 17.885
1774
+ - type: recall_at_1
1775
+ value: 38.847
1776
+ - type: recall_at_10
1777
+ value: 85.76700000000001
1778
+ - type: recall_at_100
1779
+ value: 98.054
1780
+ - type: recall_at_1000
1781
+ value: 99.812
1782
+ - type: recall_at_3
1783
+ value: 64.82
1784
+ - type: recall_at_5
1785
+ value: 75.381
1786
+ - task:
1787
+ type: Retrieval
1788
+ dataset:
1789
+ type: quora
1790
+ name: MTEB QuoraRetrieval
1791
+ config: default
1792
+ split: test
1793
+ revision: None
1794
+ metrics:
1795
+ - type: map_at_1
1796
+ value: 68.77
1797
+ - type: map_at_10
1798
+ value: 83.195
1799
+ - type: map_at_100
1800
+ value: 83.869
1801
+ - type: map_at_1000
1802
+ value: 83.883
1803
+ - type: map_at_3
1804
+ value: 80.04599999999999
1805
+ - type: map_at_5
1806
+ value: 82.011
1807
+ - type: mrr_at_1
1808
+ value: 79.2
1809
+ - type: mrr_at_10
1810
+ value: 85.942
1811
+ - type: mrr_at_100
1812
+ value: 86.063
1813
+ - type: mrr_at_1000
1814
+ value: 86.064
1815
+ - type: mrr_at_3
1816
+ value: 84.82
1817
+ - type: mrr_at_5
1818
+ value: 85.56899999999999
1819
+ - type: ndcg_at_1
1820
+ value: 79.17999999999999
1821
+ - type: ndcg_at_10
1822
+ value: 87.161
1823
+ - type: ndcg_at_100
1824
+ value: 88.465
1825
+ - type: ndcg_at_1000
1826
+ value: 88.553
1827
+ - type: ndcg_at_3
1828
+ value: 83.958
1829
+ - type: ndcg_at_5
1830
+ value: 85.699
1831
+ - type: precision_at_1
1832
+ value: 79.17999999999999
1833
+ - type: precision_at_10
1834
+ value: 13.401
1835
+ - type: precision_at_100
1836
+ value: 1.54
1837
+ - type: precision_at_1000
1838
+ value: 0.157
1839
+ - type: precision_at_3
1840
+ value: 36.903000000000006
1841
+ - type: precision_at_5
1842
+ value: 24.404
1843
+ - type: recall_at_1
1844
+ value: 68.77
1845
+ - type: recall_at_10
1846
+ value: 95.132
1847
+ - type: recall_at_100
1848
+ value: 99.58200000000001
1849
+ - type: recall_at_1000
1850
+ value: 99.997
1851
+ - type: recall_at_3
1852
+ value: 86.119
1853
+ - type: recall_at_5
1854
+ value: 90.932
1855
+ - task:
1856
+ type: Clustering
1857
+ dataset:
1858
+ type: mteb/reddit-clustering
1859
+ name: MTEB RedditClustering
1860
+ config: default
1861
+ split: test
1862
+ revision: 24640382cdbf8abc73003fb0fa6d111a705499eb
1863
+ metrics:
1864
+ - type: v_measure
1865
+ value: 61.7204049654583
1866
+ - task:
1867
+ type: Clustering
1868
+ dataset:
1869
+ type: mteb/reddit-clustering-p2p
1870
+ name: MTEB RedditClusteringP2P
1871
+ config: default
1872
+ split: test
1873
+ revision: 282350215ef01743dc01b456c7f5241fa8937f16
1874
+ metrics:
1875
+ - type: v_measure
1876
+ value: 63.98164986883849
1877
+ - task:
1878
+ type: Retrieval
1879
+ dataset:
1880
+ type: scidocs
1881
+ name: MTEB SCIDOCS
1882
+ config: default
1883
+ split: test
1884
+ revision: None
1885
+ metrics:
1886
+ - type: map_at_1
1887
+ value: 5.443
1888
+ - type: map_at_10
1889
+ value: 13.86
1890
+ - type: map_at_100
1891
+ value: 16.496
1892
+ - type: map_at_1000
1893
+ value: 16.836000000000002
1894
+ - type: map_at_3
1895
+ value: 9.661
1896
+ - type: map_at_5
1897
+ value: 11.745
1898
+ - type: mrr_at_1
1899
+ value: 26.8
1900
+ - type: mrr_at_10
1901
+ value: 37.777
1902
+ - type: mrr_at_100
1903
+ value: 38.928000000000004
1904
+ - type: mrr_at_1000
1905
+ value: 38.967
1906
+ - type: mrr_at_3
1907
+ value: 34.083000000000006
1908
+ - type: mrr_at_5
1909
+ value: 36.308
1910
+ - type: ndcg_at_1
1911
+ value: 26.8
1912
+ - type: ndcg_at_10
1913
+ value: 22.961000000000002
1914
+ - type: ndcg_at_100
1915
+ value: 32.582
1916
+ - type: ndcg_at_1000
1917
+ value: 37.972
1918
+ - type: ndcg_at_3
1919
+ value: 21.292
1920
+ - type: ndcg_at_5
1921
+ value: 18.945999999999998
1922
+ - type: precision_at_1
1923
+ value: 26.8
1924
+ - type: precision_at_10
1925
+ value: 12.06
1926
+ - type: precision_at_100
1927
+ value: 2.593
1928
+ - type: precision_at_1000
1929
+ value: 0.388
1930
+ - type: precision_at_3
1931
+ value: 19.900000000000002
1932
+ - type: precision_at_5
1933
+ value: 16.84
1934
+ - type: recall_at_1
1935
+ value: 5.443
1936
+ - type: recall_at_10
1937
+ value: 24.445
1938
+ - type: recall_at_100
1939
+ value: 52.602000000000004
1940
+ - type: recall_at_1000
1941
+ value: 78.767
1942
+ - type: recall_at_3
1943
+ value: 12.098
1944
+ - type: recall_at_5
1945
+ value: 17.077
1946
+ - task:
1947
+ type: STS
1948
+ dataset:
1949
+ type: mteb/sickr-sts
1950
+ name: MTEB SICK-R
1951
+ config: default
1952
+ split: test
1953
+ revision: a6ea5a8cab320b040a23452cc28066d9beae2cee
1954
+ metrics:
1955
+ - type: cos_sim_spearman
1956
+ value: 83.9379272617096
1957
+ - task:
1958
+ type: STS
1959
+ dataset:
1960
+ type: mteb/sts12-sts
1961
+ name: MTEB STS12
1962
+ config: default
1963
+ split: test
1964
+ revision: a0d554a64d88156834ff5ae9920b964011b16384
1965
+ metrics:
1966
+ - type: cos_sim_spearman
1967
+ value: 79.26752176661364
1968
+ - task:
1969
+ type: STS
1970
+ dataset:
1971
+ type: mteb/sts13-sts
1972
+ name: MTEB STS13
1973
+ config: default
1974
+ split: test
1975
+ revision: 7e90230a92c190f1bf69ae9002b8cea547a64cca
1976
+ metrics:
1977
+ - type: cos_sim_spearman
1978
+ value: 84.8327309083665
1979
+ - task:
1980
+ type: STS
1981
+ dataset:
1982
+ type: mteb/sts14-sts
1983
+ name: MTEB STS14
1984
+ config: default
1985
+ split: test
1986
+ revision: 6031580fec1f6af667f0bd2da0a551cf4f0b2375
1987
+ metrics:
1988
+ - type: cos_sim_spearman
1989
+ value: 82.9394255552954
1990
+ - task:
1991
+ type: STS
1992
+ dataset:
1993
+ type: mteb/sts15-sts
1994
+ name: MTEB STS15
1995
+ config: default
1996
+ split: test
1997
+ revision: ae752c7c21bf194d8b67fd573edf7ae58183cbe3
1998
+ metrics:
1999
+ - type: cos_sim_spearman
2000
+ value: 88.08995363382608
2001
+ - task:
2002
+ type: STS
2003
+ dataset:
2004
+ type: mteb/sts16-sts
2005
+ name: MTEB STS16
2006
+ config: default
2007
+ split: test
2008
+ revision: 4d8694f8f0e0100860b497b999b3dbed754a0513
2009
+ metrics:
2010
+ - type: cos_sim_spearman
2011
+ value: 86.53522220099619
2012
+ - task:
2013
+ type: STS
2014
+ dataset:
2015
+ type: mteb/sts17-crosslingual-sts
2016
+ name: MTEB STS17 (en-en)
2017
+ config: en-en
2018
+ split: test
2019
+ revision: af5e6fb845001ecf41f4c1e033ce921939a2a68d
2020
+ metrics:
2021
+ - type: cos_sim_spearman
2022
+ value: 89.57796559847532
2023
+ - task:
2024
+ type: STS
2025
+ dataset:
2026
+ type: mteb/sts22-crosslingual-sts
2027
+ name: MTEB STS22 (en)
2028
+ config: en
2029
+ split: test
2030
+ revision: 6d1ba47164174a496b7fa5d3569dae26a6813b80
2031
+ metrics:
2032
+ - type: cos_sim_spearman
2033
+ value: 67.66598855577894
2034
+ - task:
2035
+ type: STS
2036
+ dataset:
2037
+ type: mteb/stsbenchmark-sts
2038
+ name: MTEB STSBenchmark
2039
+ config: default
2040
+ split: test
2041
+ revision: b0fddb56ed78048fa8b90373c8a3cfc37b684831
2042
+ metrics:
2043
+ - type: cos_sim_spearman
2044
+ value: 88.0472708354572
2045
+ - task:
2046
+ type: Reranking
2047
+ dataset:
2048
+ type: mteb/scidocs-reranking
2049
+ name: MTEB SciDocsRR
2050
+ config: default
2051
+ split: test
2052
+ revision: d3c5e1fc0b855ab6097bf1cda04dd73947d7caab
2053
+ metrics:
2054
+ - type: map
2055
+ value: 86.04689157650684
2056
+ - type: mrr
2057
+ value: 96.51889958262507
2058
+ - task:
2059
+ type: Retrieval
2060
+ dataset:
2061
+ type: scifact
2062
+ name: MTEB SciFact
2063
+ config: default
2064
+ split: test
2065
+ revision: None
2066
+ metrics:
2067
+ - type: map_at_1
2068
+ value: 62.827999999999996
2069
+ - type: map_at_10
2070
+ value: 73.54899999999999
2071
+ - type: map_at_100
2072
+ value: 73.892
2073
+ - type: map_at_1000
2074
+ value: 73.901
2075
+ - type: map_at_3
2076
+ value: 70.663
2077
+ - type: map_at_5
2078
+ value: 72.449
2079
+ - type: mrr_at_1
2080
+ value: 66.0
2081
+ - type: mrr_at_10
2082
+ value: 74.554
2083
+ - type: mrr_at_100
2084
+ value: 74.81700000000001
2085
+ - type: mrr_at_1000
2086
+ value: 74.82600000000001
2087
+ - type: mrr_at_3
2088
+ value: 72.667
2089
+ - type: mrr_at_5
2090
+ value: 73.717
2091
+ - type: ndcg_at_1
2092
+ value: 66.0
2093
+ - type: ndcg_at_10
2094
+ value: 78.218
2095
+ - type: ndcg_at_100
2096
+ value: 79.706
2097
+ - type: ndcg_at_1000
2098
+ value: 79.925
2099
+ - type: ndcg_at_3
2100
+ value: 73.629
2101
+ - type: ndcg_at_5
2102
+ value: 75.89
2103
+ - type: precision_at_1
2104
+ value: 66.0
2105
+ - type: precision_at_10
2106
+ value: 10.333
2107
+ - type: precision_at_100
2108
+ value: 1.113
2109
+ - type: precision_at_1000
2110
+ value: 0.11299999999999999
2111
+ - type: precision_at_3
2112
+ value: 28.889
2113
+ - type: precision_at_5
2114
+ value: 19.067
2115
+ - type: recall_at_1
2116
+ value: 62.827999999999996
2117
+ - type: recall_at_10
2118
+ value: 91.533
2119
+ - type: recall_at_100
2120
+ value: 98.333
2121
+ - type: recall_at_1000
2122
+ value: 100.0
2123
+ - type: recall_at_3
2124
+ value: 79.0
2125
+ - type: recall_at_5
2126
+ value: 84.68900000000001
2127
+ - task:
2128
+ type: PairClassification
2129
+ dataset:
2130
+ type: mteb/sprintduplicatequestions-pairclassification
2131
+ name: MTEB SprintDuplicateQuestions
2132
+ config: default
2133
+ split: test
2134
+ revision: d66bd1f72af766a5cc4b0ca5e00c162f89e8cc46
2135
+ metrics:
2136
+ - type: cos_sim_accuracy
2137
+ value: 99.8019801980198
2138
+ - type: cos_sim_ap
2139
+ value: 95.09301057928796
2140
+ - type: cos_sim_f1
2141
+ value: 89.71193415637859
2142
+ - type: cos_sim_precision
2143
+ value: 92.37288135593221
2144
+ - type: cos_sim_recall
2145
+ value: 87.2
2146
+ - type: dot_accuracy
2147
+ value: 99.72079207920792
2148
+ - type: dot_ap
2149
+ value: 92.77707970155015
2150
+ - type: dot_f1
2151
+ value: 85.88588588588588
2152
+ - type: dot_precision
2153
+ value: 85.97194388777555
2154
+ - type: dot_recall
2155
+ value: 85.8
2156
+ - type: euclidean_accuracy
2157
+ value: 99.7980198019802
2158
+ - type: euclidean_ap
2159
+ value: 95.04124481520121
2160
+ - type: euclidean_f1
2161
+ value: 89.61693548387096
2162
+ - type: euclidean_precision
2163
+ value: 90.34552845528455
2164
+ - type: euclidean_recall
2165
+ value: 88.9
2166
+ - type: manhattan_accuracy
2167
+ value: 99.7960396039604
2168
+ - type: manhattan_ap
2169
+ value: 95.02691504694813
2170
+ - type: manhattan_f1
2171
+ value: 89.60321446509292
2172
+ - type: manhattan_precision
2173
+ value: 90.0100908173562
2174
+ - type: manhattan_recall
2175
+ value: 89.2
2176
+ - type: max_accuracy
2177
+ value: 99.8019801980198
2178
+ - type: max_ap
2179
+ value: 95.09301057928796
2180
+ - type: max_f1
2181
+ value: 89.71193415637859
2182
+ - task:
2183
+ type: Clustering
2184
+ dataset:
2185
+ type: mteb/stackexchange-clustering
2186
+ name: MTEB StackExchangeClustering
2187
+ config: default
2188
+ split: test
2189
+ revision: 6cbc1f7b2bc0622f2e39d2c77fa502909748c259
2190
+ metrics:
2191
+ - type: v_measure
2192
+ value: 72.74124969197169
2193
+ - task:
2194
+ type: Clustering
2195
+ dataset:
2196
+ type: mteb/stackexchange-clustering-p2p
2197
+ name: MTEB StackExchangeClusteringP2P
2198
+ config: default
2199
+ split: test
2200
+ revision: 815ca46b2622cec33ccafc3735d572c266efdb44
2201
+ metrics:
2202
+ - type: v_measure
2203
+ value: 32.262798307863996
2204
+ - task:
2205
+ type: Reranking
2206
+ dataset:
2207
+ type: mteb/stackoverflowdupquestions-reranking
2208
+ name: MTEB StackOverflowDupQuestions
2209
+ config: default
2210
+ split: test
2211
+ revision: e185fbe320c72810689fc5848eb6114e1ef5ec69
2212
+ metrics:
2213
+ - type: map
2214
+ value: 54.823414217790464
2215
+ - type: mrr
2216
+ value: 55.557133838383834
2217
+ - task:
2218
+ type: Summarization
2219
+ dataset:
2220
+ type: mteb/summeval
2221
+ name: MTEB SummEval
2222
+ config: default
2223
+ split: test
2224
+ revision: cda12ad7615edc362dbf25a00fdd61d3b1eaf93c
2225
+ metrics:
2226
+ - type: cos_sim_pearson
2227
+ value: 31.01226930465494
2228
+ - type: cos_sim_spearman
2229
+ value: 30.9368445798007
2230
+ - type: dot_pearson
2231
+ value: 30.204833368654533
2232
+ - type: dot_spearman
2233
+ value: 30.438900411966618
2234
+ - task:
2235
+ type: Retrieval
2236
+ dataset:
2237
+ type: trec-covid
2238
+ name: MTEB TRECCOVID
2239
+ config: default
2240
+ split: test
2241
+ revision: None
2242
+ metrics:
2243
+ - type: map_at_1
2244
+ value: 0.22699999999999998
2245
+ - type: map_at_10
2246
+ value: 2.0420000000000003
2247
+ - type: map_at_100
2248
+ value: 13.33
2249
+ - type: map_at_1000
2250
+ value: 33.627
2251
+ - type: map_at_3
2252
+ value: 0.639
2253
+ - type: map_at_5
2254
+ value: 1.056
2255
+ - type: mrr_at_1
2256
+ value: 84.0
2257
+ - type: mrr_at_10
2258
+ value: 91.167
2259
+ - type: mrr_at_100
2260
+ value: 91.167
2261
+ - type: mrr_at_1000
2262
+ value: 91.167
2263
+ - type: mrr_at_3
2264
+ value: 90.667
2265
+ - type: mrr_at_5
2266
+ value: 91.167
2267
+ - type: ndcg_at_1
2268
+ value: 82.0
2269
+ - type: ndcg_at_10
2270
+ value: 80.337
2271
+ - type: ndcg_at_100
2272
+ value: 65.852
2273
+ - type: ndcg_at_1000
2274
+ value: 59.821000000000005
2275
+ - type: ndcg_at_3
2276
+ value: 81.061
2277
+ - type: ndcg_at_5
2278
+ value: 81.396
2279
+ - type: precision_at_1
2280
+ value: 84.0
2281
+ - type: precision_at_10
2282
+ value: 85.0
2283
+ - type: precision_at_100
2284
+ value: 67.75999999999999
2285
+ - type: precision_at_1000
2286
+ value: 26.272000000000002
2287
+ - type: precision_at_3
2288
+ value: 85.333
2289
+ - type: precision_at_5
2290
+ value: 86.4
2291
+ - type: recall_at_1
2292
+ value: 0.22699999999999998
2293
+ - type: recall_at_10
2294
+ value: 2.241
2295
+ - type: recall_at_100
2296
+ value: 16.478
2297
+ - type: recall_at_1000
2298
+ value: 56.442
2299
+ - type: recall_at_3
2300
+ value: 0.672
2301
+ - type: recall_at_5
2302
+ value: 1.143
2303
+ - task:
2304
+ type: Retrieval
2305
+ dataset:
2306
+ type: webis-touche2020
2307
+ name: MTEB Touche2020
2308
+ config: default
2309
+ split: test
2310
+ revision: None
2311
+ metrics:
2312
+ - type: map_at_1
2313
+ value: 1.836
2314
+ - type: map_at_10
2315
+ value: 8.536000000000001
2316
+ - type: map_at_100
2317
+ value: 14.184
2318
+ - type: map_at_1000
2319
+ value: 15.885
2320
+ - type: map_at_3
2321
+ value: 3.7359999999999998
2322
+ - type: map_at_5
2323
+ value: 5.253
2324
+ - type: mrr_at_1
2325
+ value: 22.448999999999998
2326
+ - type: mrr_at_10
2327
+ value: 34.77
2328
+ - type: mrr_at_100
2329
+ value: 36.18
2330
+ - type: mrr_at_1000
2331
+ value: 36.18
2332
+ - type: mrr_at_3
2333
+ value: 30.612000000000002
2334
+ - type: mrr_at_5
2335
+ value: 32.449
2336
+ - type: ndcg_at_1
2337
+ value: 20.408
2338
+ - type: ndcg_at_10
2339
+ value: 20.498
2340
+ - type: ndcg_at_100
2341
+ value: 33.354
2342
+ - type: ndcg_at_1000
2343
+ value: 45.699
2344
+ - type: ndcg_at_3
2345
+ value: 19.292
2346
+ - type: ndcg_at_5
2347
+ value: 19.541
2348
+ - type: precision_at_1
2349
+ value: 22.448999999999998
2350
+ - type: precision_at_10
2351
+ value: 19.387999999999998
2352
+ - type: precision_at_100
2353
+ value: 7.163
2354
+ - type: precision_at_1000
2355
+ value: 1.541
2356
+ - type: precision_at_3
2357
+ value: 19.728
2358
+ - type: precision_at_5
2359
+ value: 20.0
2360
+ - type: recall_at_1
2361
+ value: 1.836
2362
+ - type: recall_at_10
2363
+ value: 15.212
2364
+ - type: recall_at_100
2365
+ value: 45.364
2366
+ - type: recall_at_1000
2367
+ value: 83.64
2368
+ - type: recall_at_3
2369
+ value: 4.651000000000001
2370
+ - type: recall_at_5
2371
+ value: 7.736
2372
+ - task:
2373
+ type: Classification
2374
+ dataset:
2375
+ type: mteb/toxic_conversations_50k
2376
+ name: MTEB ToxicConversationsClassification
2377
+ config: default
2378
+ split: test
2379
+ revision: d7c0de2777da35d6aae2200a62c6e0e5af397c4c
2380
+ metrics:
2381
+ - type: accuracy
2382
+ value: 70.5856
2383
+ - type: ap
2384
+ value: 14.297836125608864
2385
+ - type: f1
2386
+ value: 54.45458507465688
2387
+ - task:
2388
+ type: Classification
2389
+ dataset:
2390
+ type: mteb/tweet_sentiment_extraction
2391
+ name: MTEB TweetSentimentExtractionClassification
2392
+ config: default
2393
+ split: test
2394
+ revision: d604517c81ca91fe16a244d1248fc021f9ecee7a
2395
+ metrics:
2396
+ - type: accuracy
2397
+ value: 61.89869835880024
2398
+ - type: f1
2399
+ value: 62.15163526419782
2400
+ - task:
2401
+ type: Clustering
2402
+ dataset:
2403
+ type: mteb/twentynewsgroups-clustering
2404
+ name: MTEB TwentyNewsgroupsClustering
2405
+ config: default
2406
+ split: test
2407
+ revision: 6125ec4e24fa026cec8a478383ee943acfbd5449
2408
+ metrics:
2409
+ - type: v_measure
2410
+ value: 56.408998393035446
2411
+ - task:
2412
+ type: PairClassification
2413
+ dataset:
2414
+ type: mteb/twittersemeval2015-pairclassification
2415
+ name: MTEB TwitterSemEval2015
2416
+ config: default
2417
+ split: test
2418
+ revision: 70970daeab8776df92f5ea462b6173c0b46fd2d1
2419
+ metrics:
2420
+ - type: cos_sim_accuracy
2421
+ value: 88.78822197055493
2422
+ - type: cos_sim_ap
2423
+ value: 81.73234934293887
2424
+ - type: cos_sim_f1
2425
+ value: 74.16373812312898
2426
+ - type: cos_sim_precision
2427
+ value: 73.18263549961469
2428
+ - type: cos_sim_recall
2429
+ value: 75.17150395778364
2430
+ - type: dot_accuracy
2431
+ value: 87.85837754068069
2432
+ - type: dot_ap
2433
+ value: 79.69812660365871
2434
+ - type: dot_f1
2435
+ value: 72.52999744702579
2436
+ - type: dot_precision
2437
+ value: 70.25222551928783
2438
+ - type: dot_recall
2439
+ value: 74.96042216358839
2440
+ - type: euclidean_accuracy
2441
+ value: 88.74649818203493
2442
+ - type: euclidean_ap
2443
+ value: 81.47777928110055
2444
+ - type: euclidean_f1
2445
+ value: 74.1248097412481
2446
+ - type: euclidean_precision
2447
+ value: 71.37274059599413
2448
+ - type: euclidean_recall
2449
+ value: 77.0976253298153
2450
+ - type: manhattan_accuracy
2451
+ value: 88.7286165583835
2452
+ - type: manhattan_ap
2453
+ value: 81.47766386927232
2454
+ - type: manhattan_f1
2455
+ value: 74.16730231375541
2456
+ - type: manhattan_precision
2457
+ value: 71.56526005888125
2458
+ - type: manhattan_recall
2459
+ value: 76.96569920844327
2460
+ - type: max_accuracy
2461
+ value: 88.78822197055493
2462
+ - type: max_ap
2463
+ value: 81.73234934293887
2464
+ - type: max_f1
2465
+ value: 74.16730231375541
2466
+ - task:
2467
+ type: PairClassification
2468
+ dataset:
2469
+ type: mteb/twitterurlcorpus-pairclassification
2470
+ name: MTEB TwitterURLCorpus
2471
+ config: default
2472
+ split: test
2473
+ revision: 8b6510b0b1fa4e4c4f879467980e9be563ec1cdf
2474
+ metrics:
2475
+ - type: cos_sim_accuracy
2476
+ value: 89.30026778437536
2477
+ - type: cos_sim_ap
2478
+ value: 86.56353001037664
2479
+ - type: cos_sim_f1
2480
+ value: 79.359197907585
2481
+ - type: cos_sim_precision
2482
+ value: 75.12379642365887
2483
+ - type: cos_sim_recall
2484
+ value: 84.10070834616569
2485
+ - type: dot_accuracy
2486
+ value: 88.8539604921023
2487
+ - type: dot_ap
2488
+ value: 85.44601003294055
2489
+ - type: dot_f1
2490
+ value: 78.20008094484713
2491
+ - type: dot_precision
2492
+ value: 74.88549080403072
2493
+ - type: dot_recall
2494
+ value: 81.82168155220204
2495
+ - type: euclidean_accuracy
2496
+ value: 89.25369658865992
2497
+ - type: euclidean_ap
2498
+ value: 86.46965679550075
2499
+ - type: euclidean_f1
2500
+ value: 79.16785612332285
2501
+ - type: euclidean_precision
2502
+ value: 73.77627028465017
2503
+ - type: euclidean_recall
2504
+ value: 85.4096088697259
2505
+ - type: manhattan_accuracy
2506
+ value: 89.26727985407692
2507
+ - type: manhattan_ap
2508
+ value: 86.46460344566123
2509
+ - type: manhattan_f1
2510
+ value: 79.1723543358
2511
+ - type: manhattan_precision
2512
+ value: 74.20875420875421
2513
+ - type: manhattan_recall
2514
+ value: 84.84755158607946
2515
+ - type: max_accuracy
2516
+ value: 89.30026778437536
2517
+ - type: max_ap
2518
+ value: 86.56353001037664
2519
+ - type: max_f1
2520
+ value: 79.359197907585
2521
+ ---
2522
+
2523
+ # LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
2524
+
2525
+ > LLM2Vec is a simple recipe to convert decoder-only LLMs into text encoders. It consists of 3 simple steps: 1) enabling bidirectional attention, 2) masked next token prediction, and 3) unsupervised contrastive learning. The model can be further fine-tuned to achieve state-of-the-art performance.
2526
+ - **Repository:** https://github.com/McGill-NLP/llm2vec
2527
+ - **Paper:** https://arxiv.org/abs/2404.05961
2528
+
2529
+
2530
+ ## Installation
2531
+ ```bash
2532
+ pip install llm2vec
2533
+ ```
2534
+
2535
+ ## Usage
2536
+ ```python
2537
+ from llm2vec import LLM2Vec
2538
+
2539
+ import torch
2540
+ from transformers import AutoTokenizer, AutoModel, AutoConfig
2541
+ from peft import PeftModel
2542
+
2543
+ # Loading base Mistral model, along with custom code that enables bidirectional connections in decoder-only LLMs. MNTP LoRA weights are merged into the base model.
2544
+ tokenizer = AutoTokenizer.from_pretrained(
2545
+ "McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp"
2546
+ )
2547
+ config = AutoConfig.from_pretrained(
2548
+ "McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp", trust_remote_code=True
2549
+ )
2550
+ model = AutoModel.from_pretrained(
2551
+ "McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp",
2552
+ trust_remote_code=True,
2553
+ config=config,
2554
+ torch_dtype=torch.bfloat16,
2555
+ device_map="cuda" if torch.cuda.is_available() else "cpu",
2556
+ )
2557
+ model = PeftModel.from_pretrained(
2558
+ model,
2559
+ "McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp",
2560
+ )
2561
+ model = model.merge_and_unload() # This can take several minutes on cpu
2562
+
2563
+ # Loading supervised model. This loads the trained LoRA weights on top of MNTP model. Hence the final weights are -- Base model + MNTP (LoRA) + supervised (LoRA).
2564
+ model = PeftModel.from_pretrained(
2565
+ model, "McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervised"
2566
+ )
2567
+
2568
+ # Wrapper for encoding and pooling operations
2569
+ l2v = LLM2Vec(model, tokenizer, pooling_mode="mean", max_length=512)
2570
+
2571
+ # Encoding queries using instructions
2572
+ instruction = (
2573
+ "Given a web search query, retrieve relevant passages that answer the query:"
2574
+ )
2575
+ queries = [
2576
+ [instruction, "how much protein should a female eat"],
2577
+ [instruction, "summit define"],
2578
+ ]
2579
+ q_reps = l2v.encode(queries)
2580
+
2581
+ # Encoding documents. Instruction are not required for documents
2582
+ documents = [
2583
+ "As a general guideline, the CDC's average requirement of protein for women ages 19 to 70 is 46 grams per day. But, as you can see from this chart, you'll need to increase that if you're expecting or training for a marathon. Check out the chart below to see how much protein you should be eating each day.",
2584
+ "Definition of summit for English Language Learners. : 1 the highest point of a mountain : the top of a mountain. : 2 the highest level. : 3 a meeting or series of meetings between the leaders of two or more governments.",
2585
+ ]
2586
+ d_reps = l2v.encode(documents)
2587
+
2588
+ # Compute cosine similarity
2589
+ q_reps_norm = torch.nn.functional.normalize(q_reps, p=2, dim=1)
2590
+ d_reps_norm = torch.nn.functional.normalize(d_reps, p=2, dim=1)
2591
+ cos_sim = torch.mm(q_reps_norm, d_reps_norm.transpose(0, 1))
2592
+
2593
+ print(cos_sim)
2594
+ """
2595
+ tensor([[0.6470, 0.1619],
2596
+ [0.0786, 0.5844]])
2597
+ """
2598
+ ```
2599
+
2600
+ ## Questions
2601
+ If you have any question about the code, feel free to email Parishad (`parishad.behnamghader@mila.quebec`) and Vaibhav (`vaibhav.adlakha@mila.quebec`).
adapter_config.json ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "alpha_pattern": {},
3
+ "auto_mapping": {
4
+ "base_model_class": "LlamaEncoderModel",
5
+ "parent_library": "llama_encoder_model.modeling_llama_encoder"
6
+ },
7
+ "base_model_name_or_path": "6Morpheus6/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp",
8
+ "bias": "none",
9
+ "fan_in_fan_out": false,
10
+ "inference_mode": true,
11
+ "init_lora_weights": true,
12
+ "layers_pattern": null,
13
+ "layers_to_transform": null,
14
+ "loftq_config": {},
15
+ "lora_alpha": 32,
16
+ "lora_dropout": 0.05,
17
+ "megatron_config": null,
18
+ "megatron_core": "megatron.core",
19
+ "modules_to_save": null,
20
+ "peft_type": "LORA",
21
+ "r": 16,
22
+ "rank_pattern": {},
23
+ "revision": null,
24
+ "target_modules": [
25
+ "k_proj",
26
+ "gate_proj",
27
+ "down_proj",
28
+ "up_proj",
29
+ "q_proj",
30
+ "o_proj",
31
+ "v_proj"
32
+ ],
33
+ "task_type": null,
34
+ "use_rslora": false
35
+ }
adapter_model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:53f8f94ebdf396667ba99dd96e78203edae27bbcdbd1cf5f12b611e1d916b225
3
+ size 167829552