MinhPhuc0804 commited on
Commit
6e8fc6e
·
verified ·
1 Parent(s): b6a8f45

Automated push: v3 Trainer, 256-seq, dev loss active

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ tokenizer.json filter=lfs diff=lfs merge=lfs -text
1_Pooling/config.json ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ {
2
+ "embedding_dimension": 1024,
3
+ "pooling_mode": "mean",
4
+ "include_prompt": true
5
+ }
README.md ADDED
@@ -0,0 +1,719 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:18281
8
+ - loss:MultipleNegativesRankingLoss
9
+ base_model: intfloat/multilingual-e5-large-instruct
10
+ widget:
11
+ - source_sentence: 'query: Travel restrictions showed up after the fact, but physical
12
+ distancing, case isolation, & testing did the trick for #FlattenTheCurve. Our
13
+ global crew''s analysis (co-led by @user & @user) of data from >30,000 #COVID19
14
+ cases is out in @ScienceMagazine. 1/12'
15
+ sentences:
16
+ - 'passage: title: The effect of human mobility and control measures on the COVID-19
17
+ epidemic in China
18
+
19
+ abstract: The ongoing coronavirus disease 2019 (COVID-19) outbreak expanded rapidly
20
+ throughout China. Major behavioral, clinical, and state interventions were undertaken
21
+ to mitigate the epidemic and prevent the persistence of the virus in human populations
22
+ in China and worldwide. It remains unclear how these unprecedented interventions,
23
+ including travel restrictions, affected COVID-19 spread in China. We used real-time
24
+ mobility data from Wuhan and detailed case data including travel history to elucidate
25
+ the role of case importation in transmission in cities across China and to ascertain
26
+ the impact of control measures. Early on, the spatial distribution of COVID-19
27
+ cases in China was explained well by human mobility data. After the implementation
28
+ of control measures, this correlation dropped and growth rates became negative
29
+ in most locations, although shifts in the demographics of reported cases were
30
+ still indicative of local chains of transmission outside of Wuhan. This study
31
+ shows that the drastic control measures implemented in China substantially mitigated
32
+ the spread of COVID-19.'
33
+ - 'passage: title: Reverse-transcribed SARS-CoV-2 RNA can integrate into the genome
34
+ of cultured human cells and can be expressed in patient-derived tissues
35
+
36
+ abstract: Prolonged detection of severe acute respiratory syndrome coronavirus
37
+ 2 (SARS-CoV-2) RNA and recurrence of PCR-positive tests have been widely reported
38
+ in patients after recovery from COVID-19, but some of these patients do not appear
39
+ to shed infectious virus. We investigated the possibility that SARS-CoV-2 RNAs
40
+ can be reverse-transcribed and integrated into the DNA of human cells in culture
41
+ and that transcription of the integrated sequences might account for some of the
42
+ positive PCR tests seen in patients. In support of this hypothesis, we found that
43
+ DNA copies of SARS-CoV-2 sequences can be integrated into the genome of infected
44
+ human cells.'
45
+ - 'passage: title: Covid-19 and alcohol—a dangerous cocktail
46
+
47
+ abstract: The principal aim of this work was to better understand how regenerating
48
+ muscle fibers become innervated in adult animals. To induce muscle regeneration,
49
+ individual identified muscle fibers in a mouse were damaged with a laser focused
50
+ through a microscope. The muscle fiber that degenerated and the muscle fiber that
51
+ was formed in its place were followed by viewing the same site repeatedly over
52
+ a period of 2 d to 40 weeks. Commonly, the nerve terminal innervating the irradiated
53
+ muscle fiber partially retracted during muscle fiber degeneration, and then sprouted
54
+ to innervate the regenerating muscle fiber at the same site it had previously
55
+ innervated the muscle fiber that was damaged. During the early phase of muscle
56
+ regeneration we also observed sprouts originating from nerve terminals on adjacent
57
+ muscle fibers. The new nerve growth was a response to the regenerating muscle
58
+ fiber rather than to the degenerated fiber it replaced because repeated damage
59
+ of the same site every 2–3 d over a 10 d period (to prevent regeneration) did
60
+ not cause any sprouting.'
61
+ - source_sentence: 'query: Check out this #preprint on @researchsquare: Vaccine‑driven
62
+ immunity offers stronger cross‑type immunity versus natural infection against
63
+ emerging SARS‑CoV‑2 variants of concern.'
64
+ sentences:
65
+ - 'passage: title: Vaccine-induced immunity provides more robust heterotypic immunity
66
+ than natural infection to emerging SARS-CoV-2 variants of concern.
67
+
68
+ abstract: <title>Abstract</title> Both natural infection with SARS-CoV-2 and immunization
69
+ with a number of vaccines induce protective immunity. However, the ability of
70
+ such immune responses to recognize and therefore protect against emerging variants
71
+ is a matter of increasing importance. Such variants of concern (VOC) include isolates
72
+ of lineage B1.1.7, first identified in the UK, and B1.351, first identified in
73
+ South Africa. Our data confirm that VOC, particularly those with substitutions
74
+ at residues 484 and 417 escape neutralization by antibodies directed to the ACE2-binding
75
+ Class 1 and the adjacent Class 2 epitopes but are susceptible to neutralization
76
+ by the generally less potent antibodies directed to Class 3 and 4 epitopes on
77
+ the flanks RBD.'
78
+ - 'passage: title: Optic neuritis following COVID-19 vaccination: Coincidence or
79
+ side-effect? - A case series
80
+
81
+ abstract: The whole world waiting for the elimination of COVID-19. This is a short
82
+ series of three cases that presented with optic neuritis. On further inquiry,
83
+ all had received the Covishield vaccine within 5-12 days just before the presentation,
84
+ with no history of COVID-19 positive RT-PCR. The range of age was 27-48 years.
85
+ All patients improved after pulse steroid therapy and are still under follow-up.
86
+ After being plagued by COVID-19 for nearly 2 years, the whole world wishes for
87
+ little more than complete eradication of the disease. Our country commenced the
88
+ much-awaited vaccination drive from Jan 2021. Ophthalmic manifestations have appeared
89
+ in many forms post-COVID-19, among which neuro-ophthalmic manifestations are infrequent.
90
+ To the best of our knowledge, this is the first report of a short case series
91
+ from our country presenting with optic neuritis after COVID-19 vaccination, without
92
+ any sign of active infection.'
93
+ - 'passage: title: Circulating Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2)
94
+ Vaccine Antigen Detected in the Plasma of mRNA-1273 Vaccine Recipients
95
+
96
+ abstract: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) proteins
97
+ were measured in longitudinal plasma samples collected from 13 participants who
98
+ received two doses of mRNA-1273 vaccine. Eleven of 13 participants showed detectable
99
+ levels of SARS-CoV-2 protein as early as day 1 after first vaccine injection.
100
+ Clearance of detectable SARS-CoV-2 protein correlated with production of immunoglobulin
101
+ G (IgG) and immunoglobulin A (IgA).'
102
+ - source_sentence: 'query: WHO is missing? Embarek Obviously this daft narrow‑minded
103
+ team of scientific puppets complaining about "time windows" are irate about his
104
+ recent Danish interview Origins of SARS‑CoV‑2: window is closing for key scientific
105
+ studies'
106
+ sentences:
107
+ - 'passage: -2 cycle threshold values during infection (i.e. lower viral burden),
108
+ and less frequently reported any symptoms.
109
+
110
+
111
+ title: Anti-spike antibody response to natural SARS-CoV-2 infection in the general
112
+ population
113
+
114
+ Among those who seroconverted, using Bayesian linear mixed models, the estimated
115
+ anti-spike IgG peak level was 7.3-fold higher than the level previously associated
116
+ with 50% protection against reinfection, with higher peak levels in older participants
117
+ and those of non-white ethnicity. The estimated anti-spike IgG half-life was 184
118
+ days, being longer in females and those of white ethnicity. We estimated antibody
119
+ levels associated with protection against reinfection likely last 1.5-2 years
120
+ on average, with levels associated with protection from severe infection present
121
+ for several years. These estimates could inform planning for vaccination booster
122
+ strategies.'
123
+ - 'passage: title: Trends in Income From 1975 to 2018
124
+
125
+ abstract: For two decades after World War II, incomes grew at a rate close to
126
+ the U.S. economy-wide growth rate. Anemic growth from 1969 to 1974 kept inequality
127
+ in check. But since then, the benefits of growth have not been evenly distributed.
128
+ Racial and gender inequality is also manifested in income inequality.'
129
+ - 'passage: title: Origins of SARS-CoV-2: window is closing for key scientific studies
130
+
131
+ abstract: Authors of the March WHO report into how COVID-19 emerged warn that
132
+ further delay makes crucial inquiry biologically difficult. Authors of the March
133
+ WHO report into how COVID-19 emerged warn that further delay makes crucial inquiry
134
+ biologically difficult.'
135
+ - source_sentence: 'query: Hartklachten en vaccinaties The chance of myocarditis after
136
+ vaccination is steadily higher in younger men, especially after a 2nd dose of
137
+ RNA mRNA-1273 vaccine Chance of myocarditis after sequential COVID-19 vaccinations
138
+ by age and gender'
139
+ sentences:
140
+ - 'passage: title: Risk of myocarditis following sequential COVID-19 vaccinations
141
+ by age and sex
142
+
143
+ abstract: ABSTRACT In an updated self-controlled case series analysis of 42,200,614
144
+ people aged 13 years or more, we evaluate the association between COVID-19 vaccination
145
+ and myocarditis, stratified by age and sex, including 10,978,507 people receiving
146
+ a third vaccine dose. Myocarditis risk was increased during 1-28 days following
147
+ a third dose of BNT162b2 (IRR 2.02, 95%CI 1.40, 2.91).'
148
+ - 'passage: in) and faster viral clearance by PCR. Viral clearance was treatment
149
+ dose- and duration-dependent.
150
+
151
+
152
+ title: Meta-analysis of randomized trials of ivermectin to treat SARS-CoV-2 infection
153
+
154
+ In six randomized trials of moderate or severe infection, there was a 75% reduction
155
+ in mortality (Relative Risk=0.25 [95%CI 0.12-0.52]; p=0.0002); 14/650 (2.1%) deaths
156
+ on ivermectin; 57/597 (9.5%) deaths in controls) with favorable clinical recovery
157
+ and reduced hospitalization. Many studies included were not peer reviewed and
158
+ meta-analyses are prone to confounding issues. Ivermectin should be validated
159
+ in larger, appropriately controlled randomized trials before the results are sufficient
160
+ for review by regulatory authorities.'
161
+ - 'passage: title: Asymptomatic transmission of covid-19
162
+
163
+ abstract: The UK''s £100bn "Operation Moonshot" to roll out mass testing for covid-19
164
+ to cities and universities around the country raises two key questions.How infectious
165
+ are people who test positive but have no symptoms?And, what is their contribution
166
+ to transmission of live virus?'
167
+ - source_sentence: 'query: @user That’s not what some of the data suggests. 25% of
168
+ those who had a light infection the first time required an ER visit the 2nd time.
169
+ And we know there is a mounting load with reinfections.'
170
+ sentences:
171
+ - 'passage: title: Ce que les sondages font à l''opinion publique
172
+
173
+ abstract: Ce que les sondages font à l''opinion. Loïc Blondiaux [117-136]. Cet
174
+ article se propose de revisiter la controverse récurrente autour des sondages
175
+ et de l''opinion qui traverse les sciences sociales et divise en particulier la
176
+ science politique. Il commence par recenser les principales critiques adressées
177
+ aux sondages d''opinion dans la sociologie et la science politique française et
178
+ anglo-saxonne. Il tente ensuite de reconstituer une brève histoire des usages
179
+ du concept d''opinion dans le discours savant. La conclusion de ce double inventaire
180
+ apparaît sans ambiguïté : les sondages ne mesurent pas l''opinion publique au
181
+ sens où les sciences sociales et le discours politique savant entendent habituellement
182
+ cette notion. La troisième et dernière partie discute plusieurs hypothèses susceptibles
183
+ de rendre compte de l''extraordinaire réussite de cette étrange mesure de l''opinion
184
+ publique.'
185
+ - 'passage: randomization to hospital discharge.
186
+
187
+
188
+ title: Effect of a Single High Dose of Vitamin D<sub>3</sub> on Hospital Length
189
+ of Stay in Patients With Moderate to Severe COVID-19
190
+
191
+ Prespecified secondary outcomes included mortality during hospitalization; the
192
+ number of patients admitted to the intensive care unit; the number of patients
193
+ who required mechanical ventilation and the duration of mechanical ventilation;
194
+ and serum levels of 25-hydroxyvitamin D, total calcium, creatinine, and C-reactive
195
+ protein.Of 240 randomized patients, 237 were included in the primary analysis
196
+ (mean [SD] age, 56.2 [14.4] years; 104 [43.9%] women; mean [SD] baseline 25-hydroxyvitamin
197
+ D level, 20.9 [9.2] ng/mL). Median (interquartile range) length of stay was not
198
+ significantly different between the vitamin D3 (7.0 [4.0-10.0] days) and placebo
199
+ groups (7.0 [5.0-13.0] days) (log-rank P = .59; unadjusted hazard ratio for hospital
200
+ discharge, 1.07 [95% CI, 0.82-1.39]; P = .62).'
201
+ - 'passage: diagnoses occur closer to the index date for infection or reinfection
202
+ in the Omicron BA epoch.
203
+
204
+
205
+ title: SARS-CoV-2 Reinfection is Preceded by Unique Biomarkers and Related to
206
+ Initial Infection Timing and Severity: an N3C RECOVER EHR-Based Cohort Study
207
+
208
+ We report lower albumin levels leading up to reinfection and a statistically significant
209
+ association of severity between first infection and reinfection (chi-squared value:
210
+ 9446.2, p-value: 0) with a medium effect size (Cramer''s V: 0.18, DoF = 4).'
211
+ pipeline_tag: sentence-similarity
212
+ library_name: sentence-transformers
213
+ metrics:
214
+ - cosine_accuracy@1
215
+ - cosine_accuracy@3
216
+ - cosine_accuracy@5
217
+ - cosine_accuracy@10
218
+ - cosine_precision@1
219
+ - cosine_precision@3
220
+ - cosine_precision@5
221
+ - cosine_precision@10
222
+ - cosine_recall@1
223
+ - cosine_recall@3
224
+ - cosine_recall@5
225
+ - cosine_recall@10
226
+ - cosine_ndcg@10
227
+ - cosine_mrr@10
228
+ - cosine_map@100
229
+ model-index:
230
+ - name: SentenceTransformer based on intfloat/multilingual-e5-large-instruct
231
+ results:
232
+ - task:
233
+ type: information-retrieval
234
+ name: Information Retrieval
235
+ dataset:
236
+ name: CT26 dev split
237
+ type: CT26-dev-split
238
+ metrics:
239
+ - type: cosine_accuracy@1
240
+ value: 0.6926272066458983
241
+ name: Cosine Accuracy@1
242
+ - type: cosine_accuracy@3
243
+ value: 0.857736240913811
244
+ name: Cosine Accuracy@3
245
+ - type: cosine_accuracy@5
246
+ value: 0.8982346832814122
247
+ name: Cosine Accuracy@5
248
+ - type: cosine_accuracy@10
249
+ value: 0.9273104880581516
250
+ name: Cosine Accuracy@10
251
+ - type: cosine_precision@1
252
+ value: 0.6926272066458983
253
+ name: Cosine Precision@1
254
+ - type: cosine_precision@3
255
+ value: 0.28591208030460363
256
+ name: Cosine Precision@3
257
+ - type: cosine_precision@5
258
+ value: 0.17964693665628245
259
+ name: Cosine Precision@5
260
+ - type: cosine_precision@10
261
+ value: 0.09273104880581517
262
+ name: Cosine Precision@10
263
+ - type: cosine_recall@1
264
+ value: 0.6926272066458983
265
+ name: Cosine Recall@1
266
+ - type: cosine_recall@3
267
+ value: 0.857736240913811
268
+ name: Cosine Recall@3
269
+ - type: cosine_recall@5
270
+ value: 0.8982346832814122
271
+ name: Cosine Recall@5
272
+ - type: cosine_recall@10
273
+ value: 0.9273104880581516
274
+ name: Cosine Recall@10
275
+ - type: cosine_ndcg@10
276
+ value: 0.8179831495116502
277
+ name: Cosine Ndcg@10
278
+ - type: cosine_mrr@10
279
+ value: 0.7820204717400984
280
+ name: Cosine Mrr@10
281
+ - type: cosine_map@100
282
+ value: 0.7847045028904378
283
+ name: Cosine Map@100
284
+ ---
285
+
286
+ # SentenceTransformer based on intfloat/multilingual-e5-large-instruct
287
+
288
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [intfloat/multilingual-e5-large-instruct](https://huggingface.co/intfloat/multilingual-e5-large-instruct) on the clef-me5-mined-pairs-train-pairs dataset. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for retrieval.
289
+
290
+ ## Model Details
291
+
292
+ ### Model Description
293
+ - **Model Type:** Sentence Transformer
294
+ - **Base model:** [intfloat/multilingual-e5-large-instruct](https://huggingface.co/intfloat/multilingual-e5-large-instruct) <!-- at revision 274baa43b0e13e37fafa6428dbc7938e62e5c439 -->
295
+ - **Maximum Sequence Length:** 256 tokens
296
+ - **Output Dimensionality:** 1024 dimensions
297
+ - **Similarity Function:** Cosine Similarity
298
+ - **Supported Modality:** Text
299
+ - **Training Dataset:**
300
+ - clef-me5-mined-pairs-train-pairs
301
+ <!-- - **Language:** Unknown -->
302
+ <!-- - **License:** Unknown -->
303
+
304
+ ### Model Sources
305
+
306
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
307
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
308
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
309
+
310
+ ### Full Model Architecture
311
+
312
+ ```
313
+ SentenceTransformer(
314
+ (0): Transformer({'transformer_task': 'feature-extraction', 'modality_config': {'text': {'method': 'forward', 'method_output_name': 'last_hidden_state'}}, 'module_output_name': 'token_embeddings', 'architecture': 'XLMRobertaModel'})
315
+ (1): Pooling({'embedding_dimension': 1024, 'pooling_mode': 'mean', 'include_prompt': True})
316
+ (2): Normalize({})
317
+ )
318
+ ```
319
+
320
+ ## Usage
321
+
322
+ ### Direct Usage (Sentence Transformers)
323
+
324
+ First install the Sentence Transformers library:
325
+
326
+ ```bash
327
+ pip install -U sentence-transformers
328
+ ```
329
+ Then you can load this model and run inference.
330
+ ```python
331
+ from sentence_transformers import SentenceTransformer
332
+
333
+ # Download from the 🤗 Hub
334
+ model = SentenceTransformer("MinhPhuc0804/me5-256-kiem-tra-di-t1-v2.2-epoch-10")
335
+ # Run inference
336
+ sentences = [
337
+ 'query: @user That’s not what some of the data suggests. 25% of those who had a light infection the first time required an ER visit the 2nd time. And we know there is a mounting load with reinfections.',
338
+ "passage: diagnoses occur closer to the index date for infection or reinfection in the Omicron BA epoch.\n\ntitle: SARS-CoV-2 Reinfection is Preceded by Unique Biomarkers and Related to Initial Infection Timing and Severity: an N3C RECOVER EHR-Based Cohort Study\nWe report lower albumin levels leading up to reinfection and a statistically significant association of severity between first infection and reinfection (chi-squared value: 9446.2, p-value: 0) with a medium effect size (Cramer's V: 0.18, DoF = 4).",
339
+ 'passage: randomization to hospital discharge.\n\ntitle: Effect of a Single High Dose of Vitamin D<sub>3</sub> on Hospital Length of Stay in Patients With Moderate to Severe COVID-19\nPrespecified secondary outcomes included mortality during hospitalization; the number of patients admitted to the intensive care unit; the number of patients who required mechanical ventilation and the duration of mechanical ventilation; and serum levels of 25-hydroxyvitamin D, total calcium, creatinine, and C-reactive protein.Of 240 randomized patients, 237 were included in the primary analysis (mean [SD] age, 56.2 [14.4] years; 104 [43.9%] women; mean [SD] baseline 25-hydroxyvitamin D level, 20.9 [9.2] ng/mL). Median (interquartile range) length of stay was not significantly different between the vitamin D3 (7.0 [4.0-10.0] days) and placebo groups (7.0 [5.0-13.0] days) (log-rank P = .59; unadjusted hazard ratio for hospital discharge, 1.07 [95% CI, 0.82-1.39]; P = .62).',
340
+ ]
341
+ embeddings = model.encode(sentences)
342
+ print(embeddings.shape)
343
+ # [3, 1024]
344
+
345
+ # Get the similarity scores for the embeddings
346
+ similarities = model.similarity(embeddings, embeddings)
347
+ print(similarities)
348
+ # tensor([[1.0000, 0.5041, 0.1662],
349
+ # [0.5041, 1.0000, 0.0334],
350
+ # [0.1662, 0.0334, 1.0000]])
351
+ ```
352
+ <!--
353
+ ### Direct Usage (Transformers)
354
+
355
+ <details><summary>Click to see the direct usage in Transformers</summary>
356
+
357
+ </details>
358
+ -->
359
+
360
+ <!--
361
+ ### Downstream Usage (Sentence Transformers)
362
+
363
+ You can finetune this model on your own dataset.
364
+
365
+ <details><summary>Click to expand</summary>
366
+
367
+ </details>
368
+ -->
369
+
370
+ <!--
371
+ ### Out-of-Scope Use
372
+
373
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
374
+ -->
375
+
376
+ ## Evaluation
377
+
378
+ ### Metrics
379
+
380
+ #### Information Retrieval
381
+
382
+ * Dataset: `CT26-dev-split`
383
+ * Evaluated with [<code>InformationRetrievalEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.sentence_transformer.evaluation.InformationRetrievalEvaluator)
384
+
385
+ | Metric | Value |
386
+ |:--------------------|:----------|
387
+ | cosine_accuracy@1 | 0.6926 |
388
+ | cosine_accuracy@3 | 0.8577 |
389
+ | cosine_accuracy@5 | 0.8982 |
390
+ | cosine_accuracy@10 | 0.9273 |
391
+ | cosine_precision@1 | 0.6926 |
392
+ | cosine_precision@3 | 0.2859 |
393
+ | cosine_precision@5 | 0.1796 |
394
+ | cosine_precision@10 | 0.0927 |
395
+ | cosine_recall@1 | 0.6926 |
396
+ | cosine_recall@3 | 0.8577 |
397
+ | cosine_recall@5 | 0.8982 |
398
+ | cosine_recall@10 | 0.9273 |
399
+ | **cosine_ndcg@10** | **0.818** |
400
+ | cosine_mrr@10 | 0.782 |
401
+ | cosine_map@100 | 0.7847 |
402
+
403
+ <!--
404
+ ## Bias, Risks and Limitations
405
+
406
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
407
+ -->
408
+
409
+ <!--
410
+ ### Recommendations
411
+
412
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
413
+ -->
414
+
415
+ ## Training Details
416
+
417
+ ### Training Dataset
418
+
419
+ #### clef-me5-mined-pairs-train-pairs
420
+
421
+ * Dataset: clef-me5-mined-pairs-train-pairs
422
+ * Size: 18,281 training samples
423
+ * Columns: <code>anchor</code> and <code>positive</code>
424
+ * Approximate statistics based on the first 1000 samples:
425
+ | | anchor | positive |
426
+ |:--------|:------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
427
+ | type | string | string |
428
+ | details | <ul><li>min: 26 tokens</li><li>mean: 59.43 tokens</li><li>max: 104 tokens</li></ul> | <ul><li>min: 26 tokens</li><li>mean: 190.97 tokens</li><li>max: 256 tokens</li></ul> |
429
+ * Samples:
430
+ | anchor | positive |
431
+ |:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
432
+ | <code>query: Peer-reviewed scientific studies project universal mask use would spare nearly 130,000 U.S. lives by February. But Utah carries on with a baffling, lax patchwork of feeble mask rules. If elected, I’ll shield your family. #PetersonProtects #utpol</code> | <code>passage: 469,578–578,347) lives could be lost to COVID-19 across the United States by 28 February 2021.<br><br>title: Modeling COVID-19 scenarios for the United States<br>We find that achieving universal mask use (95% mask use in public) could be sufficient to ameliorate the worst effects of epidemic resurgences in many states. Universal mask use could save an additional 129,574 (85,284–170,867) lives from September 22, 2020 through the end of February 2021, or an additional 95,814 (60,731–133,077) lives assuming a lesser adoption of mask wearing (85%), when compared to the reference scenario.</code> |
433
+ | <code>query: "Seroprevalence studies may fail to detect people who have had mild covid-19. Consideration should be given to [...] calibration of assay thresholds, the breadth of the antibody response, and the role of mucosal antibodies"</code> | <code>passage: was used to analyse the data.<br><br>title: Are we underestimating seroprevalence of SARS-CoV-2?<br><h3>Results</h3> The study indicates a fairly high knowledge level of STI/HIV (89.4%) with more males (87%) and younger participants (88%) possessing good knowledge of STI/HIV. Majority of the participants are sexually active (63.3%) and of this are more males (61.3%) and younger participants (60%). Findings show that age (β = 0.025; t = 0.04; p &gt; 0.05) of the three predictor variables was not a predictor of attitude towards STI/HIV. However, knowledge of STI/HIV (β = 0.459; t = 5.032; p &lt; 0.05) and sexual behaviour (β = 0.341; t = 4.278; p &lt; 0.05) were strong predictors of attitude towards STI/HIV. <h3>Conclusion</h3> This study shows the need for strong advocacy, enlightenment and community mobilisation for improved awareness of STI/HIV.</code> |
434
+ | <code>query: Rebuilding of October #Arctic sea ice volume spanning over the past 100 years... [Side-by-side look between PIOMAS-20C and PIOMAS data sets now refreshed through October 2021. Model details available at</code> | <code>passage: title: Arctic Sea Ice Volume Variability over 1901–2010: A Model-Based Reconstruction
435
+ abstract: Abstract PIOMAS-20C, an Arctic sea ice reconstruction for 1901–2010, is produced by forcing the Pan-Arctic Ice Ocean Modeling and Assimilation System (PIOMAS) with ERA-20C atmospheric data. ERA-20C performance over Arctic sea ice is assessed by comparisons with measurements and data from other reanalyses. ERA-20C performs similarly with respect to the annual cycle of downwelling radiation, air temperature, and wind speed compared to reanalyses with more extensive data assimilation such as ERA-Interim and MERRA. PIOMAS-20C sea ice thickness and volume are then compared with in situ and aircraft remote sensing observations for the period of ~1950–2010. Error statistics are similar to those for PIOMAS. We compare the magnitude and patterns of sea ice variability between the first half of the twentieth century (1901–40) and the more recent period (1980–2010), both marked by sea ice decl...</code> |
436
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
437
+ ```json
438
+ {
439
+ "scale": 20.0,
440
+ "similarity_fct": "cos_sim",
441
+ "gather_across_devices": false,
442
+ "directions": [
443
+ "query_to_doc"
444
+ ],
445
+ "partition_mode": "joint",
446
+ "hardness_mode": null,
447
+ "hardness_strength": 0.0
448
+ }
449
+ ```
450
+
451
+ ### Evaluation Dataset
452
+
453
+ #### clef-me5-mined-pairs-train-pairs
454
+
455
+ * Dataset: clef-me5-mined-pairs-train-pairs
456
+ * Size: 963 evaluation samples
457
+ * Columns: <code>anchor</code> and <code>positive</code>
458
+ * Approximate statistics based on the first 963 samples:
459
+ | | anchor | positive |
460
+ |:--------|:------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
461
+ | type | string | string |
462
+ | details | <ul><li>min: 24 tokens</li><li>mean: 59.09 tokens</li><li>max: 138 tokens</li></ul> | <ul><li>min: 30 tokens</li><li>mean: 189.75 tokens</li><li>max: 256 tokens</li></ul> |
463
+ * Samples:
464
+ | anchor | positive |
465
+ |:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
466
+ | <code>query: I reckon we’ll be hearing more about plitidepsin, which (in vitro, at least) is 27.5 times stronger than remdesivir #COVID19</code> | <code>passage: title: Plitidepsin has potent preclinical efficacy against SARS-CoV-2 by targeting the host protein eEF1A<br>abstract: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) viral proteins interact with the eukaryotic translation machinery, and inhibitors of translation have potent antiviral effects. We found that the drug plitidepsin (aplidin), which has limited clinical approval, possesses antiviral activity (90% inhibitory concentration = 0.88 nM) that is more potent than remdesivir against SARS-CoV-2 in vitro by a factor of 27.5, with limited toxicity in cell culture. Through the use of a drug-resistant mutant, we show that the antiviral activity of plitidepsin against SARS-CoV-2 is mediated through inhibition of the known target eEF1A (eukaryotic translation elongation factor 1A).</code> |
467
+ | <code>query: 2020 research on #LongCovid. Pre‑vaccine era. “Young, low risk patients with ongoing symptoms of #covid19 had signs of damage to multiple organs four months after initially being infected.”</code> | <code>passage: title: Long covid: Damage to multiple organs presents in young, low risk patients<br>abstract: Young, low risk patients with ongoing symptoms of covid-19 had signs of damage to multiple organs four months after initially being infected, a preprint study has suggested.</code> |
468
+ | <code>query: L'inflammation indépendante provoquée par les macrophages encourage-t-elle les lésions alvéolaires dans la COVID-19 ?</code> | <code>passage: title: Does autonomous macrophage-driven inflammation promote alveolar damage in COVID-19?<br>abstract: <b>The editorial reviews an <i>ERJ</i> publication which shows direct viral replication is rare in the alveolar space due to rare ACE2 expression. Instead it posits that autonomous macrophage inflammation occurs and drives lung injury.</b>https://bit.ly/3CqjwiT</code> |
469
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
470
+ ```json
471
+ {
472
+ "scale": 20.0,
473
+ "similarity_fct": "cos_sim",
474
+ "gather_across_devices": false,
475
+ "directions": [
476
+ "query_to_doc"
477
+ ],
478
+ "partition_mode": "joint",
479
+ "hardness_mode": null,
480
+ "hardness_strength": 0.0
481
+ }
482
+ ```
483
+
484
+ ### Training Hyperparameters
485
+ #### Non-Default Hyperparameters
486
+
487
+ - `per_device_train_batch_size`: 64
488
+ - `per_device_eval_batch_size`: 64
489
+ - `learning_rate`: 1.6e-05
490
+ - `num_train_epochs`: 10
491
+ - `warmup_ratio`: 0.1
492
+ - `fp16`: True
493
+ - `dataloader_num_workers`: 8
494
+
495
+ #### All Hyperparameters
496
+ <details><summary>Click to expand</summary>
497
+
498
+ - `overwrite_output_dir`: False
499
+ - `do_predict`: False
500
+ - `prediction_loss_only`: True
501
+ - `per_device_train_batch_size`: 64
502
+ - `per_device_eval_batch_size`: 64
503
+ - `per_gpu_train_batch_size`: None
504
+ - `per_gpu_eval_batch_size`: None
505
+ - `gradient_accumulation_steps`: 1
506
+ - `eval_accumulation_steps`: None
507
+ - `torch_empty_cache_steps`: None
508
+ - `learning_rate`: 1.6e-05
509
+ - `weight_decay`: 0.0
510
+ - `adam_beta1`: 0.9
511
+ - `adam_beta2`: 0.999
512
+ - `adam_epsilon`: 1e-08
513
+ - `max_grad_norm`: 1.0
514
+ - `num_train_epochs`: 10
515
+ - `max_steps`: -1
516
+ - `lr_scheduler_type`: linear
517
+ - `lr_scheduler_kwargs`: {}
518
+ - `warmup_ratio`: 0.1
519
+ - `warmup_steps`: 0
520
+ - `log_level`: passive
521
+ - `log_level_replica`: warning
522
+ - `log_on_each_node`: True
523
+ - `logging_nan_inf_filter`: True
524
+ - `save_safetensors`: True
525
+ - `save_on_each_node`: False
526
+ - `save_only_model`: False
527
+ - `restore_callback_states_from_checkpoint`: False
528
+ - `no_cuda`: False
529
+ - `use_cpu`: False
530
+ - `use_mps_device`: False
531
+ - `seed`: 42
532
+ - `data_seed`: None
533
+ - `jit_mode_eval`: False
534
+ - `use_ipex`: False
535
+ - `bf16`: False
536
+ - `fp16`: True
537
+ - `fp16_opt_level`: O1
538
+ - `half_precision_backend`: auto
539
+ - `bf16_full_eval`: False
540
+ - `fp16_full_eval`: False
541
+ - `tf32`: None
542
+ - `local_rank`: 0
543
+ - `ddp_backend`: None
544
+ - `tpu_num_cores`: None
545
+ - `tpu_metrics_debug`: False
546
+ - `debug`: []
547
+ - `dataloader_drop_last`: False
548
+ - `dataloader_num_workers`: 8
549
+ - `dataloader_prefetch_factor`: None
550
+ - `past_index`: -1
551
+ - `disable_tqdm`: False
552
+ - `remove_unused_columns`: True
553
+ - `label_names`: None
554
+ - `load_best_model_at_end`: False
555
+ - `ignore_data_skip`: False
556
+ - `fsdp`: []
557
+ - `fsdp_min_num_params`: 0
558
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
559
+ - `fsdp_transformer_layer_cls_to_wrap`: None
560
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
561
+ - `parallelism_config`: None
562
+ - `deepspeed`: None
563
+ - `label_smoothing_factor`: 0.0
564
+ - `optim`: adamw_torch_fused
565
+ - `optim_args`: None
566
+ - `adafactor`: False
567
+ - `group_by_length`: False
568
+ - `length_column_name`: length
569
+ - `ddp_find_unused_parameters`: None
570
+ - `ddp_bucket_cap_mb`: None
571
+ - `ddp_broadcast_buffers`: False
572
+ - `dataloader_pin_memory`: True
573
+ - `dataloader_persistent_workers`: False
574
+ - `skip_memory_metrics`: True
575
+ - `use_legacy_prediction_loop`: False
576
+ - `push_to_hub`: False
577
+ - `resume_from_checkpoint`: None
578
+ - `hub_model_id`: None
579
+ - `hub_strategy`: every_save
580
+ - `hub_private_repo`: None
581
+ - `hub_always_push`: False
582
+ - `hub_revision`: None
583
+ - `gradient_checkpointing`: False
584
+ - `gradient_checkpointing_kwargs`: None
585
+ - `include_inputs_for_metrics`: False
586
+ - `include_for_metrics`: []
587
+ - `eval_do_concat_batches`: True
588
+ - `fp16_backend`: auto
589
+ - `push_to_hub_model_id`: None
590
+ - `push_to_hub_organization`: None
591
+ - `mp_parameters`:
592
+ - `auto_find_batch_size`: False
593
+ - `full_determinism`: False
594
+ - `torchdynamo`: None
595
+ - `ray_scope`: last
596
+ - `ddp_timeout`: 1800
597
+ - `torch_compile`: False
598
+ - `torch_compile_backend`: None
599
+ - `torch_compile_mode`: None
600
+ - `include_tokens_per_second`: False
601
+ - `include_num_input_tokens_seen`: False
602
+ - `neftune_noise_alpha`: None
603
+ - `optim_target_modules`: None
604
+ - `batch_eval_metrics`: False
605
+ - `eval_on_start`: False
606
+ - `use_liger_kernel`: False
607
+ - `liger_kernel_config`: None
608
+ - `eval_use_gather_object`: False
609
+ - `average_tokens_across_devices`: False
610
+ - `prompts`: None
611
+ - `batch_sampler`: batch_sampler
612
+ - `multi_dataset_batch_sampler`: proportional
613
+ - `router_mapping`: {}
614
+ - `learning_rate_mapping`: {}
615
+
616
+ </details>
617
+
618
+ ### Training Logs
619
+ | Epoch | Step | Training Loss | Validation Loss | CT26-dev-split_cosine_ndcg@10 |
620
+ |:------:|:----:|:-------------:|:---------------:|:-----------------------------:|
621
+ | 0.3497 | 100 | 1.412 | - | - |
622
+ | 0.6993 | 200 | 0.4583 | - | - |
623
+ | 1.0 | 286 | - | 0.3687 | 0.8100 |
624
+ | 1.0490 | 300 | 0.4552 | - | - |
625
+ | 1.3986 | 400 | 0.3449 | - | - |
626
+ | 1.7483 | 500 | 0.334 | - | - |
627
+ | 2.0 | 572 | - | 0.3241 | 0.8166 |
628
+ | 2.0979 | 600 | 0.2666 | - | - |
629
+ | 2.4476 | 700 | 0.1872 | - | - |
630
+ | 2.7972 | 800 | 0.2041 | - | - |
631
+ | 3.0 | 858 | - | 0.3176 | 0.8194 |
632
+ | 3.1469 | 900 | 0.1789 | - | - |
633
+ | 3.4965 | 1000 | 0.1246 | - | - |
634
+ | 3.8462 | 1100 | 0.1279 | - | - |
635
+ | 4.0 | 1144 | - | 0.3149 | 0.8181 |
636
+ | 4.1958 | 1200 | 0.1071 | - | - |
637
+ | 4.5455 | 1300 | 0.0869 | - | - |
638
+ | 4.8951 | 1400 | 0.0895 | - | - |
639
+ | 5.0 | 1430 | - | 0.3100 | 0.8152 |
640
+ | 5.2448 | 1500 | 0.0773 | - | - |
641
+ | 5.5944 | 1600 | 0.0726 | - | - |
642
+ | 5.9441 | 1700 | 0.0767 | - | - |
643
+ | 6.0 | 1716 | - | 0.2971 | 0.8175 |
644
+ | 6.2937 | 1800 | 0.0625 | - | - |
645
+ | 6.6434 | 1900 | 0.06 | - | - |
646
+ | 6.9930 | 2000 | 0.0667 | - | - |
647
+ | 7.0 | 2002 | - | 0.2981 | 0.8210 |
648
+ | 7.3427 | 2100 | 0.0609 | - | - |
649
+ | 7.6923 | 2200 | 0.0549 | - | - |
650
+ | 8.0 | 2288 | - | 0.3009 | 0.8222 |
651
+ | 8.0420 | 2300 | 0.0503 | - | - |
652
+ | 8.3916 | 2400 | 0.0487 | - | - |
653
+ | 8.7413 | 2500 | 0.0498 | - | - |
654
+ | 9.0 | 2574 | - | 0.3020 | 0.8210 |
655
+ | 9.0909 | 2600 | 0.0456 | - | - |
656
+ | 9.4406 | 2700 | 0.0496 | - | - |
657
+ | 9.7902 | 2800 | 0.0521 | - | - |
658
+ | 10.0 | 2860 | - | 0.2993 | 0.8180 |
659
+
660
+
661
+ ### Training Time
662
+ - **Training**: 21.5 minutes
663
+
664
+ ### Framework Versions
665
+ - Python: 3.12.6
666
+ - Sentence Transformers: 5.4.1
667
+ - Transformers: 4.56.0
668
+ - PyTorch: 2.8.0+cu129
669
+ - Accelerate: 1.10.1
670
+ - Datasets: 4.8.5
671
+ - Tokenizers: 0.22.0
672
+
673
+ ## Citation
674
+
675
+ ### BibTeX
676
+
677
+ #### Sentence Transformers
678
+ ```bibtex
679
+ @inproceedings{reimers-2019-sentence-bert,
680
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
681
+ author = "Reimers, Nils and Gurevych, Iryna",
682
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
683
+ month = "11",
684
+ year = "2019",
685
+ publisher = "Association for Computational Linguistics",
686
+ url = "https://arxiv.org/abs/1908.10084",
687
+ }
688
+ ```
689
+
690
+ #### MultipleNegativesRankingLoss
691
+ ```bibtex
692
+ @misc{oord2019representationlearningcontrastivepredictive,
693
+ title={Representation Learning with Contrastive Predictive Coding},
694
+ author={Aaron van den Oord and Yazhe Li and Oriol Vinyals},
695
+ year={2019},
696
+ eprint={1807.03748},
697
+ archivePrefix={arXiv},
698
+ primaryClass={cs.LG},
699
+ url={https://arxiv.org/abs/1807.03748},
700
+ }
701
+ ```
702
+
703
+ <!--
704
+ ## Glossary
705
+
706
+ *Clearly define terms in order to be accessible across audiences.*
707
+ -->
708
+
709
+ <!--
710
+ ## Model Card Authors
711
+
712
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
713
+ -->
714
+
715
+ <!--
716
+ ## Model Card Contact
717
+
718
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
719
+ -->
config.json ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "XLMRobertaModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "bos_token_id": 0,
7
+ "classifier_dropout": null,
8
+ "dtype": "float32",
9
+ "eos_token_id": 2,
10
+ "hidden_act": "gelu",
11
+ "hidden_dropout_prob": 0.1,
12
+ "hidden_size": 1024,
13
+ "initializer_range": 0.02,
14
+ "intermediate_size": 4096,
15
+ "layer_norm_eps": 1e-05,
16
+ "max_position_embeddings": 514,
17
+ "model_type": "xlm-roberta",
18
+ "num_attention_heads": 16,
19
+ "num_hidden_layers": 24,
20
+ "output_past": true,
21
+ "pad_token_id": 1,
22
+ "position_embedding_type": "absolute",
23
+ "transformers_version": "4.56.0",
24
+ "type_vocab_size": 1,
25
+ "use_cache": true,
26
+ "vocab_size": 250002
27
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "pytorch": "2.8.0+cu129",
4
+ "sentence_transformers": "5.4.1",
5
+ "transformers": "4.56.0"
6
+ },
7
+ "default_prompt_name": null,
8
+ "model_type": "SentenceTransformer",
9
+ "prompts": {
10
+ "document": "",
11
+ "query": ""
12
+ },
13
+ "similarity_fn_name": "cosine"
14
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d736a1a9c76b3c4429d080ed08357ec85e223db19631d165fefa87e61d879cef
3
+ size 2239607176
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.base.modules.transformer.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.sentence_transformer.modules.pooling.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.sentence_transformer.modules.normalize.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "transformer_task": "feature-extraction",
3
+ "modality_config": {
4
+ "text": {
5
+ "method": "forward",
6
+ "method_output_name": "last_hidden_state"
7
+ }
8
+ },
9
+ "module_output_name": "token_embeddings"
10
+ }
sentencepiece.bpe.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cfc8146abe2a0488e9e2a0c56de7952f7c11ab059eca145a0a727afce0db2865
3
+ size 5069051
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": false,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "<unk>",
46
+ "lstrip": false,
47
+ "normalized": false,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a514807cffabd8abaf028cfaffe7ff0c4f60b97ea2db80c41f14172ae6b018ca
3
+ size 17082987
tokenizer_config.json ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<s>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<pad>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "</s>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<unk>",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "250001": {
36
+ "content": "<mask>",
37
+ "lstrip": true,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "additional_special_tokens": [],
45
+ "bos_token": "<s>",
46
+ "clean_up_tokenization_spaces": true,
47
+ "cls_token": "<s>",
48
+ "eos_token": "</s>",
49
+ "extra_special_tokens": {},
50
+ "mask_token": "<mask>",
51
+ "model_max_length": 256,
52
+ "pad_token": "<pad>",
53
+ "sep_token": "</s>",
54
+ "tokenizer_class": "XLMRobertaTokenizer",
55
+ "unk_token": "<unk>"
56
+ }