Sentence Similarity
sentence-transformers
Safetensors
xlm-roberta
feature-extraction
Generated from Trainer
dataset_size:4858
loss:MultipleNegativesRankingLoss
text-embeddings-inference
Instructions to use Sathvik0101/srag-biencoder-hn with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use Sathvik0101/srag-biencoder-hn with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("Sathvik0101/srag-biencoder-hn") sentences = [ "My partner has made a serious mistake that has deeply hurt our relationship. I feel immense anger and betrayal, but also a deep love. I'm caught between forgiving them to save the relationship or protecting myself by walking away, and I don't know which choice will bring me peace.", "hṛṣīkeśaṃ tadā vākyam idam āha mahīpate | senayor ubhayor madhye rathaṃ sthāpaya me 'cyuta ||21|| yāvad etān nirīkṣe 'haṃ yoddhukāmān avasthitān | kair mayā saha yoddhavyam asmin raṇasamudyame ||22|| yotsyamānān avekṣe 'haṃ ya ete 'tra samāgatāḥ | dhārtarāṣṭrasya durbuddher yuddhe priyacikīrṣavaḥ ||23||", "na caitad vidmaḥ kataran no garīyo yad vā jayema yadi vā no jayeyuḥ | yān eva hatvā na jijīviṣāmas te 'vasthitāḥ pramukhe dhārtarāṣṭrāḥ ||6||", "suhṛn-mitrāry-udāsīna-madhyastha-dveṣya-bandhuṣu | sādhuṣv api ca pāpeṣu sama-buddhir viśiṣyate ||9||", "samaḥ śatrau ca mitre ca tathā mānāpamānayoḥ | śītoṣṇa-sukha-duḥkheṣu samaḥ saṅga-vivarjitaḥ ||18|| tulya-nindā-stutir maunī saṃtuṣṭo yena kenacit | aniketaḥ sthira-matir bhaktimān me priyo naraḥ ||19||", "kārpaṇya-doṣopahata-svabhāvaḥ pṛcchāmi tvāṃ dharma-saṃmūḍha-cetāḥ | yac chreyaḥ syān niścitaṃ brūhi tan me śiṣyas te 'haṃ śādhi māṃ tvāṃ prapannam ||7||", "aniṣṭam iṣṭaṃ miśraṃ ca trividhaṃ karmaṇaḥ phalam | bhavaty atyāgināṃ pretya na tu saṃnyāsināṃ kvacit ||12||", "samaḥ śatrau ca mitre ca tathā mānāpamānayoḥ | śītoṣṇa-sukha-duḥkheṣu samaḥ saṅga-vivarjitaḥ ||18|| tulya-nindā-stutir maunī saṃtuṣṭo yena kenacit | aniketaḥ sthira-matir bhaktimān me priyo naraḥ ||19||", "nirmāna-mohā jita-saṅga-doṣā adhyātma-nityā vinivṛtta-kāmāḥ | dvandvair vimuktāḥ sukha-duḥkha-saṃjñair gacchanty amūḍhāḥ padam avyayaṃ tat ||5||" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [9, 9] - Notebooks
- Google Colab
- Kaggle
metadata
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:4858
- loss:MultipleNegativesRankingLoss
base_model: sanganaka/bge-m3-sanskritFT
widget:
- source_sentence: >-
My partner has made a serious mistake that has deeply hurt our
relationship. I feel immense anger and betrayal, but also a deep love. I'm
caught between forgiving them to save the relationship or protecting
myself by walking away, and I don't know which choice will bring me peace.
sentences:
- >-
hṛṣīkeśaṃ tadā vākyam idam āha mahīpate | senayor ubhayor madhye rathaṃ
sthāpaya me 'cyuta ||21|| yāvad etān nirīkṣe 'haṃ yoddhukāmān avasthitān
| kair mayā saha yoddhavyam asmin raṇasamudyame ||22|| yotsyamānān
avekṣe 'haṃ ya ete 'tra samāgatāḥ | dhārtarāṣṭrasya durbuddher yuddhe
priyacikīrṣavaḥ ||23||
- >-
na caitad vidmaḥ kataran no garīyo yad vā jayema yadi vā no jayeyuḥ |
yān eva hatvā na jijīviṣāmas te 'vasthitāḥ pramukhe dhārtarāṣṭrāḥ ||6||
- >-
suhṛn-mitrāry-udāsīna-madhyastha-dveṣya-bandhuṣu | sādhuṣv api ca pāpeṣu
sama-buddhir viśiṣyate ||9||
- >-
samaḥ śatrau ca mitre ca tathā mānāpamānayoḥ | śītoṣṇa-sukha-duḥkheṣu
samaḥ saṅga-vivarjitaḥ ||18|| tulya-nindā-stutir maunī saṃtuṣṭo yena
kenacit | aniketaḥ sthira-matir bhaktimān me priyo naraḥ ||19||
- >-
kārpaṇya-doṣopahata-svabhāvaḥ pṛcchāmi tvāṃ dharma-saṃmūḍha-cetāḥ | yac
chreyaḥ syān niścitaṃ brūhi tan me śiṣyas te 'haṃ śādhi māṃ tvāṃ
prapannam ||7||
- >-
aniṣṭam iṣṭaṃ miśraṃ ca trividhaṃ karmaṇaḥ phalam | bhavaty atyāgināṃ
pretya na tu saṃnyāsināṃ kvacit ||12||
- >-
samaḥ śatrau ca mitre ca tathā mānāpamānayoḥ | śītoṣṇa-sukha-duḥkheṣu
samaḥ saṅga-vivarjitaḥ ||18|| tulya-nindā-stutir maunī saṃtuṣṭo yena
kenacit | aniketaḥ sthira-matir bhaktimān me priyo naraḥ ||19||
- >-
nirmāna-mohā jita-saṅga-doṣā adhyātma-nityā vinivṛtta-kāmāḥ | dvandvair
vimuktāḥ sukha-duḥkha-saṃjñair gacchanty amūḍhāḥ padam avyayaṃ tat ||5||
- source_sentence: >-
I’ve accumulated so many possessions, a big house, fancy cars, but I still
feel empty and unfulfilled. Why does nothing seem to bring lasting joy?
sentences:
- >-
vīta-rāga-bhaya-krodhā man-mayā mām upāśritāḥ | bahavo jñāna-tapasā pūtā
mad-bhāvam āgatāḥ ||10||
- >-
mahātmānas tu māṃ pārtha daivīṃ prakṛtim āśritāḥ | bhajanty
ananya-manaso jñātvā bhūtādim avyayam ||13||
- >-
yadṛcchā-lābha-santuṣṭo dvandvātīto vimatsaraḥ | samaḥ siddhāv asiddhau
ca kṛtvāpi na nibadhyate ||22||
- >-
aneka-bāhūdara-vaktra-netraṃ paśyāmi tvā sarvato 'nanta-rūpam | nāntaṃ
na madhyaṃ na punas tavādiṃ paśyāmi viśveśvara viśvarūpa ||16||
- >-
sarva-karmāṇi manasā saṃnyasyāste sukhaṃ vaśī | nava-dvāre pure dehī
naiva kurvan na kārayan ||13||
- >-
amī hi tvā sura-saṃghā viśanti kecid bhītāḥ prāñjalayo gṛṇanti |
svastīty uktvā maharṣi-siddha-saṃghāḥ stuvanti tvāṃ stutibhiḥ
puṣkalābhiḥ ||21||
- >-
mām upetya punar janma duḥkhālayam aśāśvatam | nāpnuvanti mahātmānaḥ
saṃsiddhiṃ paramāṃ gatāḥ ||15||
- >-
paras tasmāt tu bhāvo 'nyo 'vyakto 'vyaktāt sanātanaḥ | yaḥ sa sarveṣu
bhūteṣu naśyatsu na vinaśyati ||20||
- source_sentence: >-
I've lost someone incredibly dear to me, and the pain is unbearable. I
feel like a part of me is gone forever. How can I heal and find meaning
amidst this sorrow?
sentences:
- >-
avyakto 'kṣara ity uktas tam āhuḥ paramāṃ gatim | yaṃ prāpya na
nivartante tad dhāma paramaṃ mama ||21||
- >-
jitātmanaḥ praśāntasya paramātmā samāhitaḥ | śītoṣṇa-sukha-duḥkheṣu
tathā mānāpamānayoḥ ||7||
- |-
ye tu sarvāṇi karmāṇi mayi saṃnyasya matparaḥ |
ananyenaiva yogena māṃ dhyāyanta upāsate |
- >-
hṛṣīkeśaṃ tadā vākyam idam āha mahīpate | senayor ubhayor madhye rathaṃ
sthāpaya me 'cyuta ||21|| yāvad etān nirīkṣe 'haṃ yoddhukāmān avasthitān
| kair mayā saha yoddhavyam asmin raṇasamudyame ||22|| yotsyamānān
avekṣe 'haṃ ya ete 'tra samāgatāḥ | dhārtarāṣṭrasya durbuddher yuddhe
priyacikīrṣavaḥ ||23||
- >-
samaḥ śatrau ca mitre ca tathā mānāpamānayoḥ | śītoṣṇa-sukha-duḥkheṣu
samaḥ saṅga-vivarjitaḥ ||18|| tulya-nindā-stutir maunī saṃtuṣṭo yena
kenacit | aniketaḥ sthira-matir bhaktimān me priyo naraḥ ||19||
- >-
mām upetya punar janma duḥkhālayam aśāśvatam | nāpnuvanti mahātmānaḥ
saṃsiddhiṃ paramāṃ gatāḥ ||15||
- >-
yasmān nodvijate loko lokān nodvijate ca yaḥ | harṣāmarṣa-bhayodvegair
mukto yaḥ sa ca me priyaḥ ||15||
- >-
āyuḥ-sattva-balārogya-sukha-prīti-vivardhanāḥ | rasyāḥ snigdhāḥ sthirā
hṛdyā āhārāḥ sāttvika-priyāḥ ||8||
- source_sentence: >-
I've invested so much time and energy into a project at work, only for it
to be unexpectedly cancelled. I feel so frustrated and defeated, like all
my effort was pointless. How do I cope with this sense of futility?
sentences:
- >-
athaitad apy aśakto 'si kartuṃ mad-yogam āśritaḥ |
sarva-karma-phala-tyāgaṃ tataḥ kuru yatātmavān ||11||
- >-
nirmāna-mohā jita-saṅga-doṣā adhyātma-nityā vinivṛtta-kāmāḥ | dvandvair
vimuktāḥ sukha-duḥkha-saṃjñair gacchanty amūḍhāḥ padam avyayaṃ tat ||5||
- >-
yadā viniyataṃ cittam ātmany evāvatiṣṭhate | niḥspṛhaḥ sarva-kāmebhyo
yukta ity ucyate tadā ||18||
- >-
naiva kiṃ cit karomīti yukto manyeta tattva-vit | paśyañ śṛṇvan spṛśañ
jighrann aśnan gacchan svapañ śvasan ||8|| pralapan visṛjan gṛhṇann
unmiṣan nimiṣann api | indriyāṇīndriyārtheṣu vartanta iti dhārayan ||9||
- >-
mayādhyakṣeṇa prakṛtiḥ sūyate sa-carācaram | hetunānena kaunteya jagad
viparivartate ||10||
- >-
śucau deśe pratiṣṭhāpya sthiram āsanam ātmanaḥ | nātyucchritaṃ nātinīcaṃ
cailājinakuśottaram ||11|| tatraikāgraṃ manaḥ kṛtvā
yata-cittendriya-kriyaḥ | upaviśyāsane yuñjyād yogam ātma-viśuddhaye
||12||
- >-
prayāṇa-kāle manasācalena bhaktyā yukto yoga-balena caiva | bhruvor
madhye prāṇam āveśya samyak sa taṃ paraṃ puruṣam upaiti divyam ||10||
- >-
kāryam ity eva yat karma niyataṃ kriyaterjuna | saṅgaṃ tyaktvā phalaṃ
caiva sa tyāgaḥ sāttviko mataḥ ||9||
- source_sentence: >-
I've recently suffered a great loss, and I feel abandoned and questioning
why such pain exists if there's a benevolent force. Where is the solace in
such suffering?
sentences:
- >-
divi sūrya-sahasrasya bhaved yugapad utthitā | yadi bhāḥ sadṛśī sā syād
bhāsas tasya mahātmanaḥ ||12||
- >-
arjuna uvāca saṃnyāsasya mahābāho tattvam icchāmi veditum | tyāgasya ca
hṛṣīkeśa pṛthak keśiniṣūdana ||1||
- >-
paritrāṇāya sādhūnāṃ vināśāya ca duṣkṛtām | dharma-saṃsthāpanārthāya
saṃbhavāmi yuge yuge ||8||
- >-
paśyaitāṃ pāṇḍuputrāṇām ācārya mahatīṃ camūm | vyūḍhāṃ drupadaputreṇa
tava śiṣyeṇa dhīmatā ||3||
- >-
tataḥ śvetair hayair yukte mahati syandane sthitau | mādhavaḥ pāṇḍavaś
caiva divyau śaṅkhau pradadhmatuḥ ||14||
- >-
saṃnyāsas tu mahābāho duḥkham āptum ayogataḥ | yoga-yukto munir brahma
nacireṇādhigacchati ||6||
- >-
mahā-bhūtāny ahaṃkāro buddhir avyaktam eva ca | indriyāṇi daśaikaṃ ca
pañca cendriya-gocarāḥ ||5|| icchā dveṣaḥ sukhaṃ duḥkhaṃ saṃghātaś
cetanā dhṛtiḥ | etat kṣetraṃ samāsena sa-vikāram udāhṛtam ||6||
- >-
avyakto 'kṣara ity uktas tam āhuḥ paramāṃ gatim | yaṃ prāpya na
nivartante tad dhāma paramaṃ mama ||21||
pipeline_tag: sentence-similarity
library_name: sentence-transformers
SentenceTransformer based on sanganaka/bge-m3-sanskritFT
This is a sentence-transformers model finetuned from sanganaka/bge-m3-sanskritFT. It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for retrieval.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: sanganaka/bge-m3-sanskritFT
- Maximum Sequence Length: 256 tokens
- Output Dimensionality: 1024 dimensions
- Similarity Function: Cosine Similarity
- Supported Modality: Text
Model Sources
- Documentation: Sentence Transformers Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Sentence Transformers on Hugging Face
Full Model Architecture
SentenceTransformer(
(0): Transformer({'transformer_task': 'feature-extraction', 'modality_config': {'text': {'method': 'forward', 'method_output_name': 'last_hidden_state'}}, 'module_output_name': 'token_embeddings', 'architecture': 'XLMRobertaModel'})
(1): Pooling({'embedding_dimension': 1024, 'pooling_mode': 'cls', 'include_prompt': True})
(2): Normalize({})
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
"I've recently suffered a great loss, and I feel abandoned and questioning why such pain exists if there's a benevolent force. Where is the solace in such suffering?",
'paritrāṇāya sādhūnāṃ vināśāya ca duṣkṛtām | dharma-saṃsthāpanārthāya saṃbhavāmi yuge yuge ||8||',
'tataḥ śvetair hayair yukte mahati syandane sthitau | mādhavaḥ pāṇḍavaś caiva divyau śaṅkhau pradadhmatuḥ ||14||',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 1024]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 1.0000, 1.0000],
# [1.0000, 1.0000, 1.0000],
# [1.0000, 1.0000, 1.0000]])
Training Details
Training Dataset
Unnamed Dataset
- Size: 4,858 training samples
- Columns:
sentence_0,sentence_1,sentence_2,sentence_3,sentence_4,sentence_5,sentence_6,sentence_7, andsentence_8 - Approximate statistics based on the first 100 samples:
sentence_0 sentence_1 sentence_2 sentence_3 sentence_4 sentence_5 sentence_6 sentence_7 sentence_8 type string string string string string string string string string modality text text text text text text text text text details - min: 25 tokens
- mean: 45.6 tokens
- max: 76 tokens
- min: 37 tokens
- mean: 66.34 tokens
- max: 256 tokens
- min: 41 tokens
- mean: 64.54 tokens
- max: 242 tokens
- min: 34 tokens
- mean: 58.37 tokens
- max: 242 tokens
- min: 34 tokens
- mean: 60.14 tokens
- max: 242 tokens
- min: 34 tokens
- mean: 63.36 tokens
- max: 165 tokens
- min: 39 tokens
- mean: 63.16 tokens
- max: 256 tokens
- min: 34 tokens
- mean: 67.22 tokens
- max: 242 tokens
- min: 37 tokens
- mean: 60.1 tokens
- max: 242 tokens
- Samples:
sentence_0 sentence_1 sentence_2 sentence_3 sentence_4 sentence_5 sentence_6 sentence_7 sentence_8 I'm constantly anxious about what the future holds—will I succeed, will my loved ones be okay, will things fall apart? I can't seem to just live in the present.ā brahma-bhuvanāl lokāḥ punar-āvartino 'rjuna | mām upetya tu kaunteya punar-janma na vidyate ||16||manuṣyāṇāṃ sahasreṣu kaś cid yatati siddhaye | yatatām api siddhānāṃ kaś cin māṃ vetti tattvataḥ ||3||tasmāt sarveṣu kāleṣu mām anusmara yudhya ca | mayy arpitamanobuddhir mām evaiṣyasy asaṃśayaḥ ||7||samaḥ śatrau ca mitre ca tathā mānāpamānayoḥ | śītoṣṇa-sukha-duḥkheṣu samaḥ saṅga-vivarjitaḥ ||18|| tulya-nindā-stutir maunī saṃtuṣṭo yena kenacit | aniketaḥ sthira-matir bhaktimān me priyo naraḥ ||19||na hi prapaśyāmi mamāpanudyād yac chokam ucchoṣaṇam indriyāṇām | avāpya bhūmāv asapatnam ṛddhaṃ rājyaṃ surāṇām api cādhipatyam ||8||anudvega-karaṃ vākyaṃ satyaṃ priya-hitaṃ ca yat | svādhyāyābhyasanaṃ caiva vāṅ-mayaṃ tapa ucyate ||15||traividyā māṃ somapāḥ pūta-pāpā yajñair iṣṭvā svar-gatiṃ prārthayante | te puṇyam āsādya surendra-lokam aśnanti divyān divi deva-bhogān ||20||āyuḥ-sattva-balārogya-sukha-prīti-vivardhanāḥ | rasyāḥ snigdhāḥ sthirā hṛdyā āhārāḥ sāttvika-priyāḥ ||8||I feel so much anger towards certain people who have wronged me or situations that have gone completely against my desires. It feels like they or life itself is conspiring against me. Why does this happen, and how can I let go of this rage?na kartṛtvaṃ na karmāṇi lokasya sṛjati prabhuḥ | na karma-phala-saṃyogaṃ svabhāvas tu pravartate ||14||ye yathā māṃ prapadyante tāṃs tathaiva bhajāmy aham | mama vartmānuvartante manuṣyāḥ pārtha sarvaśaḥ ||11||kleśo 'dhikataras teṣām avyaktāsakta-cetasām | avyaktā hi gatir duḥkhaṃ dehavadbhir avāpyate ||5||manuṣyāṇāṃ sahasreṣu kaś cid yatati siddhaye | yatatām api siddhānāṃ kaś cin māṃ vetti tattvataḥ ||3||yadā yadā hi dharmasya glānir bhavati bhārata | abhyutthānam adharmasya tadātmānaṃ sṛjāmy aham ||7||mac-cittā mad-gata-prāṇā bodhayantaḥ parasparam | kathayantaś ca māṃ nityaṃ tuṣyanti ca ramanti ca ||9||aśāstra-vihitaṃ ghoraṃ tapyante ye tapo janāḥ | dambhāhaṃkāra-saṃyuktāḥ kāma-rāga-balānvitāḥ ||5|| karśayantaḥ śarīra-sthaṃ bhūta-grāmam acetasaḥ | māṃ caivāntaḥ-śarīra-sthaṃ tān viddhy āsura-niścayān ||6||vedāvināśinaṃ nityaṃ ya enam ajam avyayam | kathaṃ sa puruṣaḥ pārtha kaṃ ghātayati hanti kam ||21||I've lost someone incredibly dear to me, and the pain is unbearable. I feel like a part of me is gone forever. How can I heal and find meaning amidst this sorrow?ye tu sarvāṇi karmāṇi mayi saṃnyasya matparaḥ |
ananyenaiva yogena māṃ dhyāyanta upāsate |āyuḥ-sattva-balārogya-sukha-prīti-vivardhanāḥ | rasyāḥ snigdhāḥ sthirā hṛdyā āhārāḥ sāttvika-priyāḥ ||8||samaḥ śatrau ca mitre ca tathā mānāpamānayoḥ | śītoṣṇa-sukha-duḥkheṣu samaḥ saṅga-vivarjitaḥ ||18|| tulya-nindā-stutir maunī saṃtuṣṭo yena kenacit | aniketaḥ sthira-matir bhaktimān me priyo naraḥ ||19||avyakto 'kṣara ity uktas tam āhuḥ paramāṃ gatim | yaṃ prāpya na nivartante tad dhāma paramaṃ mama ||21||hṛṣīkeśaṃ tadā vākyam idam āha mahīpate | senayor ubhayor madhye rathaṃ sthāpaya me 'cyuta ||21|| yāvad etān nirīkṣe 'haṃ yoddhukāmān avasthitān | kair mayā saha yoddhavyam asmin raṇasamudyame ||22|| yotsyamānān avekṣe 'haṃ ya ete 'tra samāgatāḥ | dhārtarāṣṭrasya durbuddher yuddhe priyacikīrṣavaḥ ||23||yasmān nodvijate loko lokān nodvijate ca yaḥ | harṣāmarṣa-bhayodvegair mukto yaḥ sa ca me priyaḥ ||15||mām upetya punar janma duḥkhālayam aśāśvatam | nāpnuvanti mahātmānaḥ saṃsiddhiṃ paramāṃ gatāḥ ||15||jitātmanaḥ praśāntasya paramātmā samāhitaḥ | śītoṣṇa-sukha-duḥkheṣu tathā mānāpamānayoḥ ||7|| - Loss:
MultipleNegativesRankingLosswith these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim", "gather_across_devices": false, "directions": [ "query_to_doc" ], "partition_mode": "joint", "hardness_mode": null, "hardness_strength": 0.0 }
Training Hyperparameters
Non-Default Hyperparameters
per_device_train_batch_size: 4num_train_epochs: 2per_device_eval_batch_size: 4multi_dataset_batch_sampler: round_robin
All Hyperparameters
Click to expand
per_device_train_batch_size: 4num_train_epochs: 2max_steps: -1learning_rate: 5e-05lr_scheduler_type: linearlr_scheduler_kwargs: Nonewarmup_steps: 0optim: adamw_torch_fusedoptim_args: Noneweight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08optim_target_modules: Nonegradient_accumulation_steps: 1average_tokens_across_devices: Truemax_grad_norm: 1label_smoothing_factor: 0.0bf16: Falsefp16: Falsebf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonegradient_checkpointing: Falsegradient_checkpointing_kwargs: Nonetorch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneuse_liger_kernel: Falseliger_kernel_config: Noneuse_cache: Falseneftune_noise_alpha: Nonetorch_empty_cache_steps: Noneauto_find_batch_size: Falselog_on_each_node: Truelogging_nan_inf_filter: Trueinclude_num_input_tokens_seen: nolog_level: passivelog_level_replica: warningdisable_tqdm: Falseproject: huggingfacetrackio_space_id: Nonetrackio_bucket_id: Nonetrackio_static_space_id: Noneper_device_eval_batch_size: 4prediction_loss_only: Trueeval_on_start: Falseeval_do_concat_batches: Trueeval_use_gather_object: Falseeval_accumulation_steps: Noneinclude_for_metrics: []batch_eval_metrics: Falsesave_only_model: Falsesave_on_each_node: Falseenable_jit_checkpoint: Falsepush_to_hub: Falsehub_private_repo: Nonehub_model_id: Nonehub_strategy: every_savehub_always_push: Falsehub_revision: Noneload_best_model_at_end: Falseignore_data_skip: Falserestore_callback_states_from_checkpoint: Falsefull_determinism: Falseseed: 42data_seed: Noneuse_cpu: Falseaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}parallelism_config: Nonedataloader_drop_last: Falsedataloader_num_workers: 0dataloader_pin_memory: Truedataloader_persistent_workers: Falsedataloader_prefetch_factor: Noneremove_unused_columns: Truelabel_names: Nonetrain_sampling_strategy: randomlength_column_name: lengthddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falseddp_static_graph: Noneddp_backend: Noneddp_timeout: 1800fsdp: Nonefsdp_config: Nonedeepspeed: Nonedebug: []skip_memory_metrics: Truedo_predict: Falseresume_from_checkpoint: Nonewarmup_ratio: Nonelocal_rank: -1prompts: Nonebatch_sampler: batch_samplermulti_dataset_batch_sampler: round_robinrouter_mapping: {}learning_rate_mapping: {}
Training Logs
| Epoch | Step | Training Loss |
|---|---|---|
| 0.4115 | 500 | 3.6841 |
| 0.8230 | 1000 | 3.5072 |
| 1.2346 | 1500 | 3.4757 |
| 1.6461 | 2000 | 3.4740 |
Training Time
- Training: 25.7 minutes
Framework Versions
- Python: 3.11.12
- Sentence Transformers: 5.5.1
- Transformers: 5.12.1
- PyTorch: 2.12.0+cu130
- Accelerate: 1.14.0
- Datasets: 5.0.0
- Tokenizers: 0.22.2
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
MultipleNegativesRankingLoss
@misc{oord2019representationlearningcontrastivepredictive,
title={Representation Learning with Contrastive Predictive Coding},
author={Aaron van den Oord and Yazhe Li and Oriol Vinyals},
year={2019},
eprint={1807.03748},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/1807.03748},
}