Sentence Similarity
sentence-transformers
PyTorch
TensorBoard
English
deberta-v2
feature-extraction
Generated from Trainer
dataset_size:123245
loss:CachedGISTEmbedLoss
Eval Results (legacy)
text-embeddings-inference
Instructions to use bobox/DeBERTa3-base-STr-CosineWaves with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use bobox/DeBERTa3-base-STr-CosineWaves with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("bobox/DeBERTa3-base-STr-CosineWaves") sentences = [ "what type of inheritance does haemochromatosis", "Nestled on the tranquil banks of the Pamlico River, Moss Landing is a vibrant new community of thoughtfully conceived, meticulously crafted single-family homes in Washington, North Carolina. Washington is renowned for its historic architecture and natural beauty.", "1 Microwave on high for 8 to 10 minutes or until tender, turning the yams once. 2 To microwave sliced yams: Wash, peel, and cut off the woody portions and ends. 3 Cut yams into quarters. 4 Place the yams and 1/2 cup water in a microwave-safe casserole.ake the Yams. 1 Place half the yams in a 1-quart casserole. 2 Layer with half the brown sugar and half the margarine. 3 Repeat the layers. 4 Bake, uncovered, in a 375 degree F oven for 30 to 35 minutes or until the yams are glazed, spooning the liquid over the yams once or twice during cooking.", "Types 1, 2, and 3 hemochromatosis are inherited in an autosomal recessive pattern, which means both copies of the gene in each cell have mutations. Most often, the parents of an individual with an autosomal recessive condition each carry one copy of the mutated gene but do not show signs and symptoms of the condition.Type 4 hemochromatosis is distinguished by its autosomal dominant inheritance pattern.With this type of inheritance, one copy of the altered gene in each cell is sufficient to cause the disorder. In most cases, an affected person has one parent with the condition.ype 1, the most common form of the disorder, and type 4 (also called ferroportin disease) begin in adulthood. Men with type 1 or type 4 hemochromatosis typically develop symptoms between the ages of 40 and 60, and women usually develop symptoms after menopause. Type 2 hemochromatosis is a juvenile-onset disorder." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Notebooks
- Google Colab
- Kaggle
Training in progress, step 2578, checkpoint
Browse files- checkpoint-2578/1_Pooling/config.json +10 -0
- checkpoint-2578/README.md +0 -0
- checkpoint-2578/added_tokens.json +3 -0
- checkpoint-2578/config.json +35 -0
- checkpoint-2578/config_sentence_transformers.json +10 -0
- checkpoint-2578/modules.json +14 -0
- checkpoint-2578/optimizer.pt +3 -0
- checkpoint-2578/pytorch_model.bin +3 -0
- checkpoint-2578/rng_state.pth +3 -0
- checkpoint-2578/scheduler.pt +3 -0
- checkpoint-2578/sentence_bert_config.json +4 -0
- checkpoint-2578/special_tokens_map.json +15 -0
- checkpoint-2578/spm.model +3 -0
- checkpoint-2578/tokenizer.json +0 -0
- checkpoint-2578/tokenizer_config.json +58 -0
- checkpoint-2578/trainer_state.json +0 -0
- checkpoint-2578/training_args.bin +3 -0
checkpoint-2578/1_Pooling/config.json
ADDED
|
@@ -0,0 +1,10 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"word_embedding_dimension": 768,
|
| 3 |
+
"pooling_mode_cls_token": false,
|
| 4 |
+
"pooling_mode_mean_tokens": true,
|
| 5 |
+
"pooling_mode_max_tokens": false,
|
| 6 |
+
"pooling_mode_mean_sqrt_len_tokens": false,
|
| 7 |
+
"pooling_mode_weightedmean_tokens": false,
|
| 8 |
+
"pooling_mode_lasttoken": false,
|
| 9 |
+
"include_prompt": true
|
| 10 |
+
}
|
checkpoint-2578/README.md
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
checkpoint-2578/added_tokens.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"[MASK]": 128000
|
| 3 |
+
}
|
checkpoint-2578/config.json
ADDED
|
@@ -0,0 +1,35 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"_name_or_path": "microsoft/deberta-v3-base",
|
| 3 |
+
"architectures": [
|
| 4 |
+
"DebertaV2Model"
|
| 5 |
+
],
|
| 6 |
+
"attention_probs_dropout_prob": 0.1,
|
| 7 |
+
"hidden_act": "gelu",
|
| 8 |
+
"hidden_dropout_prob": 0.1,
|
| 9 |
+
"hidden_size": 768,
|
| 10 |
+
"initializer_range": 0.02,
|
| 11 |
+
"intermediate_size": 3072,
|
| 12 |
+
"layer_norm_eps": 1e-07,
|
| 13 |
+
"max_position_embeddings": 512,
|
| 14 |
+
"max_relative_positions": -1,
|
| 15 |
+
"model_type": "deberta-v2",
|
| 16 |
+
"norm_rel_ebd": "layer_norm",
|
| 17 |
+
"num_attention_heads": 12,
|
| 18 |
+
"num_hidden_layers": 12,
|
| 19 |
+
"pad_token_id": 0,
|
| 20 |
+
"pooler_dropout": 0,
|
| 21 |
+
"pooler_hidden_act": "gelu",
|
| 22 |
+
"pooler_hidden_size": 768,
|
| 23 |
+
"pos_att_type": [
|
| 24 |
+
"p2c",
|
| 25 |
+
"c2p"
|
| 26 |
+
],
|
| 27 |
+
"position_biased_input": false,
|
| 28 |
+
"position_buckets": 256,
|
| 29 |
+
"relative_attention": true,
|
| 30 |
+
"share_att_key": true,
|
| 31 |
+
"torch_dtype": "float32",
|
| 32 |
+
"transformers_version": "4.44.0",
|
| 33 |
+
"type_vocab_size": 0,
|
| 34 |
+
"vocab_size": 128100
|
| 35 |
+
}
|
checkpoint-2578/config_sentence_transformers.json
ADDED
|
@@ -0,0 +1,10 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"__version__": {
|
| 3 |
+
"sentence_transformers": "3.0.1",
|
| 4 |
+
"transformers": "4.44.0",
|
| 5 |
+
"pytorch": "2.4.0"
|
| 6 |
+
},
|
| 7 |
+
"prompts": {},
|
| 8 |
+
"default_prompt_name": null,
|
| 9 |
+
"similarity_fn_name": null
|
| 10 |
+
}
|
checkpoint-2578/modules.json
ADDED
|
@@ -0,0 +1,14 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
[
|
| 2 |
+
{
|
| 3 |
+
"idx": 0,
|
| 4 |
+
"name": "0",
|
| 5 |
+
"path": "",
|
| 6 |
+
"type": "sentence_transformers.models.Transformer"
|
| 7 |
+
},
|
| 8 |
+
{
|
| 9 |
+
"idx": 1,
|
| 10 |
+
"name": "1",
|
| 11 |
+
"path": "1_Pooling",
|
| 12 |
+
"type": "sentence_transformers.models.Pooling"
|
| 13 |
+
}
|
| 14 |
+
]
|
checkpoint-2578/optimizer.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5c8b040407e4b08e9de61dabbd4624bf1d1075d2fe84c58f420a21dbadbcaccb
|
| 3 |
+
size 1470818042
|
checkpoint-2578/pytorch_model.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5cf02223e71307a5f4f36e0e2710d03f0ac39f39f98a15749637360dd2f976cb
|
| 3 |
+
size 735393442
|
checkpoint-2578/rng_state.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a2f0a7c3774682da89d3ba66d37fcbd40a01b26a2943aea21307e2f25b5c9104
|
| 3 |
+
size 14244
|
checkpoint-2578/scheduler.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:b9dcc6600c490b89d7d5f1e4fab2af55336fe44660aa70fa2eca7e587932336f
|
| 3 |
+
size 1064
|
checkpoint-2578/sentence_bert_config.json
ADDED
|
@@ -0,0 +1,4 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"max_seq_length": 512,
|
| 3 |
+
"do_lower_case": false
|
| 4 |
+
}
|
checkpoint-2578/special_tokens_map.json
ADDED
|
@@ -0,0 +1,15 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"bos_token": "[CLS]",
|
| 3 |
+
"cls_token": "[CLS]",
|
| 4 |
+
"eos_token": "[SEP]",
|
| 5 |
+
"mask_token": "[MASK]",
|
| 6 |
+
"pad_token": "[PAD]",
|
| 7 |
+
"sep_token": "[SEP]",
|
| 8 |
+
"unk_token": {
|
| 9 |
+
"content": "[UNK]",
|
| 10 |
+
"lstrip": false,
|
| 11 |
+
"normalized": true,
|
| 12 |
+
"rstrip": false,
|
| 13 |
+
"single_word": false
|
| 14 |
+
}
|
| 15 |
+
}
|
checkpoint-2578/spm.model
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c679fbf93643d19aab7ee10c0b99e460bdbc02fedf34b92b05af343b4af586fd
|
| 3 |
+
size 2464616
|
checkpoint-2578/tokenizer.json
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
checkpoint-2578/tokenizer_config.json
ADDED
|
@@ -0,0 +1,58 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"added_tokens_decoder": {
|
| 3 |
+
"0": {
|
| 4 |
+
"content": "[PAD]",
|
| 5 |
+
"lstrip": false,
|
| 6 |
+
"normalized": false,
|
| 7 |
+
"rstrip": false,
|
| 8 |
+
"single_word": false,
|
| 9 |
+
"special": true
|
| 10 |
+
},
|
| 11 |
+
"1": {
|
| 12 |
+
"content": "[CLS]",
|
| 13 |
+
"lstrip": false,
|
| 14 |
+
"normalized": false,
|
| 15 |
+
"rstrip": false,
|
| 16 |
+
"single_word": false,
|
| 17 |
+
"special": true
|
| 18 |
+
},
|
| 19 |
+
"2": {
|
| 20 |
+
"content": "[SEP]",
|
| 21 |
+
"lstrip": false,
|
| 22 |
+
"normalized": false,
|
| 23 |
+
"rstrip": false,
|
| 24 |
+
"single_word": false,
|
| 25 |
+
"special": true
|
| 26 |
+
},
|
| 27 |
+
"3": {
|
| 28 |
+
"content": "[UNK]",
|
| 29 |
+
"lstrip": false,
|
| 30 |
+
"normalized": true,
|
| 31 |
+
"rstrip": false,
|
| 32 |
+
"single_word": false,
|
| 33 |
+
"special": true
|
| 34 |
+
},
|
| 35 |
+
"128000": {
|
| 36 |
+
"content": "[MASK]",
|
| 37 |
+
"lstrip": false,
|
| 38 |
+
"normalized": false,
|
| 39 |
+
"rstrip": false,
|
| 40 |
+
"single_word": false,
|
| 41 |
+
"special": true
|
| 42 |
+
}
|
| 43 |
+
},
|
| 44 |
+
"bos_token": "[CLS]",
|
| 45 |
+
"clean_up_tokenization_spaces": true,
|
| 46 |
+
"cls_token": "[CLS]",
|
| 47 |
+
"do_lower_case": false,
|
| 48 |
+
"eos_token": "[SEP]",
|
| 49 |
+
"mask_token": "[MASK]",
|
| 50 |
+
"model_max_length": 1000000000000000019884624838656,
|
| 51 |
+
"pad_token": "[PAD]",
|
| 52 |
+
"sep_token": "[SEP]",
|
| 53 |
+
"sp_model_kwargs": {},
|
| 54 |
+
"split_by_punct": false,
|
| 55 |
+
"tokenizer_class": "DebertaV2Tokenizer",
|
| 56 |
+
"unk_token": "[UNK]",
|
| 57 |
+
"vocab_type": "spm"
|
| 58 |
+
}
|
checkpoint-2578/trainer_state.json
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
checkpoint-2578/training_args.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bd87e469d7441c9256c28d82e79e8e3da32741d45a6bc310f3f601255edbdf8d
|
| 3 |
+
size 5752
|