Text Ranking
sentence-transformers
Safetensors
xlm-roberta
cross-encoder
reranker
Generated from Trainer
dataset_size:126
loss:BinaryCrossEntropyLoss
Eval Results (legacy)
text-embeddings-inference
Instructions to use pujithapsx/finetuned_bge_reranker_m3_merged_fullrecord_v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use pujithapsx/finetuned_bge_reranker_m3_merged_fullrecord_v1 with sentence-transformers:
from sentence_transformers import CrossEncoder model = CrossEncoder("pujithapsx/finetuned_bge_reranker_m3_merged_fullrecord_v1") query = "Which planet is known as the Red Planet?" passages = [ "Venus is often called Earth's twin because of its similar size and proximity.", "Mars, known for its reddish appearance, is often referred to as the Red Planet.", "Jupiter, the largest planet in our solar system, has a prominent red spot.", "Saturn, famous for its rings, is sometimes mistaken for the Red Planet." ] scores = model.predict([(query, passage) for passage in passages]) print(scores) - Notebooks
- Google Colab
- Kaggle
Production-ready merged model - LoRA fine-tuned on rich Indian entity resolution dataset (211 records)
94ddeff verified metadata
tags:
- sentence-transformers
- cross-encoder
- reranker
- generated_from_trainer
- dataset_size:126
- loss:BinaryCrossEntropyLoss
base_model: BAAI/bge-reranker-v2-m3
pipeline_tag: text-ranking
library_name: sentence-transformers
metrics:
- accuracy
- accuracy_threshold
- f1
- f1_threshold
- precision
- recall
- average_precision
model-index:
- name: CrossEncoder based on BAAI/bge-reranker-v2-m3
results:
- task:
type: cross-encoder-classification
name: Cross Encoder Classification
dataset:
name: rich entity matching lora
type: rich-entity-matching-lora
metrics:
- type: accuracy
value: 1
name: Accuracy
- type: accuracy_threshold
value: 0.49942895770072937
name: Accuracy Threshold
- type: f1
value: 1
name: F1
- type: f1_threshold
value: 0.49942895770072937
name: F1 Threshold
- type: precision
value: 1
name: Precision
- type: recall
value: 1
name: Recall
- type: average_precision
value: 1
name: Average Precision
CrossEncoder based on BAAI/bge-reranker-v2-m3
This is a Cross Encoder model finetuned from BAAI/bge-reranker-v2-m3 using the sentence-transformers library. It computes scores for pairs of texts, which can be used for text reranking and semantic search.
Model Details
Model Description
- Model Type: Cross Encoder
- Base model: BAAI/bge-reranker-v2-m3
- Maximum Sequence Length: 512 tokens
- Number of Output Labels: 1 label
Model Sources
- Documentation: Sentence Transformers Documentation
- Documentation: Cross Encoder Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Cross Encoders on Hugging Face
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import CrossEncoder
# Download from the 🤗 Hub
model = CrossEncoder("pujithapsx/finetuned_bge_reranker_m3_merged_fullrecord_v1")
# Get scores for pairs of texts
pairs = [
['Name: Rohan Rao Sharma | First: Rohan | Middle: Rao | Last: Sharma | Gender: M | DOB: 1983-08-04 | Spouse: | Mother: | Father: | Company: ORACLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 551 KORAMANGALA NEAR MALL | City: AHMEDABAD | State: GUJARAT | ZIP: 380071 | Address1: 551 KORAMANGALA NEAR MALL | City1: AHMEDABAD | State1: GUJARAT | ZIP1: 380071 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 8641615633 | Phone1: | Phone2: | Phone3: | Phone4: | Email: rohan.sharma@gmail.com | Email1: | Email2: | Email3: | Email4: ', 'Name: Rohan Rao Sharma | First: Rohan | Middle: Rao | Last: Sharma | Gender: M | DOB: 1983-08-04 | Spouse: | Mother: | Father: | Company: ORACLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 551 KORAMANGALA NEAR MALL | City: AHMEDABAD | State: GUJARAT | ZIP: 380071 | Address1: 551 KORAMANGALA NEAR MALL | City1: AHMEDABAD | State1: GUJARAT | ZIP1: 380071 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: rohan.sharma@yahoo.com | Email1: | Email2: | Email3: | Email4: '],
['Name: Priya Prasad Reddy | First: Priya | Middle: Prasad | Last: Reddy | Gender: F | DOB: 1983-07-03 | Spouse: | Mother: | Father: | Company: GLOBAL INFOTECH | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: FLAT 401, LAKE APT, 24 MAIN RD | City: LUCKNOW | State: UTTAR PRADESH | ZIP: 226001 | Address1: FLAT 401, LAKE APT, 24 MAIN RD | City1: LUCKNOW | State1: UTTAR PRADESH | ZIP1: 226001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9149203558 | Phone1: | Phone2: | Phone3: | Phone4: | Email: priyareddy@gmail.com | Email1: priya.reddy@globalinfotech.com | Email2: | Email3: | Email4: ', 'Name: Priya Reddy | First: Priya | Middle: | Last: Reddy | Gender: F | DOB: 1983-07-03 | Spouse: | Mother: | Father: | Company: GLOBAL INFOTECH PVT LTD | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD | City: LUCKNOW | State: UTTAR PRADESH | ZIP: 226001 | Address1: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD | City1: LUCKNOW | State1: UTTAR PRADESH | ZIP1: 226001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9208449460 | Phone1: | Phone2: | Phone3: | Phone4: | Email: priyareddy@gmail.com | Email1: priya.reddy@globalinfotech.com | Email2: | Email3: | Email4: '],
['Name: Aditya Singh | First: Aditya | Middle: | Last: Singh | Gender: M | DOB: 1982-02-07 | Spouse: | Mother: | Father: | Company: CAPGEMINI INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 753 SECTOR 12 NEAR PARK | City: COIMBATORE | State: TAMIL NADU | ZIP: 641099 | Address1: 753 SECTOR 12 NEAR PARK | City1: COIMBATORE | State1: TAMIL NADU | ZIP1: 641099 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: adityasingh@company.in | Email1: | Email2: | Email3: | Email4: ', 'Name: Kiran Chopra | First: Kiran | Middle: | Last: Chopra | Gender: M | DOB: 1986-04-13 | Spouse: | Mother: | Father: | Company: GOOGLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 524 MG ROAD NEAR SCHOOL | City: NAGPUR | State: MAHARASHTRA | ZIP: 440013 | Address1: 524 MG ROAD NEAR SCHOOL | City1: NAGPUR | State1: MAHARASHTRA | ZIP1: 440013 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: kiranchopra@company.in | Email1: | Email2: | Email3: | Email4: '],
['Name: Divya Patel | First: Divya | Middle: | Last: Patel | Gender: F | DOB: 1985-02-01 | Spouse: | Mother: | Father: | Company: INFOSYS TECHNOLOGIES | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 316 SECTOR 12 NEAR PARK | City: HYDERABAD | State: TELANGANA | ZIP: 500048 | Address1: 316 SECTOR 12 NEAR PARK | City1: HYDERABAD | State1: TELANGANA | ZIP1: 500048 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 8883181644 | Phone1: | Phone2: | Phone3: | Phone4: | Email: divyapatel@outlook.com | Email1: | Email2: | Email3: | Email4: ', 'Name: Nitin Rao | First: Nitin | Middle: | Last: Rao | Gender: M | DOB: 1987-12-02 | Spouse: | Mother: | Father: | Company: RELIANCE JIO | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 612 BANJARA HILLS NEAR PARK | City: COIMBATORE | State: TAMIL NADU | ZIP: 641039 | Address1: 612 BANJARA HILLS NEAR PARK | City1: COIMBATORE | State1: TAMIL NADU | ZIP1: 641039 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9358550137 | Phone1: | Phone2: | Phone3: | Phone4: | Email: nitinrao@gmail.com | Email1: | Email2: | Email3: | Email4: '],
['Name: Neha Gupta | First: Neha | Middle: | Last: Gupta | Gender: F | DOB: 1981-04-13 | Spouse: | Mother: | Father: | Company: HEALTHCARE SYSTEMS | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: HOUSE 57, BLOCK A kanpur | City: KANPUR | State: UTTAR PRADESH | ZIP: 208001 | Address1: HOUSE 57, BLOCK A kanpur | City1: KANPUR | State1: UTTAR PRADESH | ZIP1: 208001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9423646570 | Phone1: | Phone2: | Phone3: | Phone4: | Email: nehagupta@gmail.com | Email1: | Email2: | Email3: | Email4: ', 'Name: Vikram Gupta | First: Vikram | Middle: | Last: Gupta | Gender: M | DOB: 1993-06-04 | Spouse: | Mother: | Father: | Company: TECH SOLUTIONS PVT LTD | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: PLOT 510, NEAR TEMPLE delhi | City: DELHI | State: DELHI | ZIP: 110001 | Address1: PLOT 510, NEAR TEMPLE delhi | City1: DELHI | State1: DELHI | ZIP1: 110001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9288650860 | Phone1: | Phone2: | Phone3: | Phone4: | Email: vikramgupta@yahoo.com | Email1: | Email2: | Email3: | Email4: '],
]
scores = model.predict(pairs)
print(scores.shape)
# (5,)
# Or rank different texts based on similarity to a single text
ranks = model.rank(
'Name: Rohan Rao Sharma | First: Rohan | Middle: Rao | Last: Sharma | Gender: M | DOB: 1983-08-04 | Spouse: | Mother: | Father: | Company: ORACLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 551 KORAMANGALA NEAR MALL | City: AHMEDABAD | State: GUJARAT | ZIP: 380071 | Address1: 551 KORAMANGALA NEAR MALL | City1: AHMEDABAD | State1: GUJARAT | ZIP1: 380071 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 8641615633 | Phone1: | Phone2: | Phone3: | Phone4: | Email: rohan.sharma@gmail.com | Email1: | Email2: | Email3: | Email4: ',
[
'Name: Rohan Rao Sharma | First: Rohan | Middle: Rao | Last: Sharma | Gender: M | DOB: 1983-08-04 | Spouse: | Mother: | Father: | Company: ORACLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 551 KORAMANGALA NEAR MALL | City: AHMEDABAD | State: GUJARAT | ZIP: 380071 | Address1: 551 KORAMANGALA NEAR MALL | City1: AHMEDABAD | State1: GUJARAT | ZIP1: 380071 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: rohan.sharma@yahoo.com | Email1: | Email2: | Email3: | Email4: ',
'Name: Priya Reddy | First: Priya | Middle: | Last: Reddy | Gender: F | DOB: 1983-07-03 | Spouse: | Mother: | Father: | Company: GLOBAL INFOTECH PVT LTD | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD | City: LUCKNOW | State: UTTAR PRADESH | ZIP: 226001 | Address1: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD | City1: LUCKNOW | State1: UTTAR PRADESH | ZIP1: 226001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9208449460 | Phone1: | Phone2: | Phone3: | Phone4: | Email: priyareddy@gmail.com | Email1: priya.reddy@globalinfotech.com | Email2: | Email3: | Email4: ',
'Name: Kiran Chopra | First: Kiran | Middle: | Last: Chopra | Gender: M | DOB: 1986-04-13 | Spouse: | Mother: | Father: | Company: GOOGLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 524 MG ROAD NEAR SCHOOL | City: NAGPUR | State: MAHARASHTRA | ZIP: 440013 | Address1: 524 MG ROAD NEAR SCHOOL | City1: NAGPUR | State1: MAHARASHTRA | ZIP1: 440013 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: kiranchopra@company.in | Email1: | Email2: | Email3: | Email4: ',
'Name: Nitin Rao | First: Nitin | Middle: | Last: Rao | Gender: M | DOB: 1987-12-02 | Spouse: | Mother: | Father: | Company: RELIANCE JIO | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 612 BANJARA HILLS NEAR PARK | City: COIMBATORE | State: TAMIL NADU | ZIP: 641039 | Address1: 612 BANJARA HILLS NEAR PARK | City1: COIMBATORE | State1: TAMIL NADU | ZIP1: 641039 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9358550137 | Phone1: | Phone2: | Phone3: | Phone4: | Email: nitinrao@gmail.com | Email1: | Email2: | Email3: | Email4: ',
'Name: Vikram Gupta | First: Vikram | Middle: | Last: Gupta | Gender: M | DOB: 1993-06-04 | Spouse: | Mother: | Father: | Company: TECH SOLUTIONS PVT LTD | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: PLOT 510, NEAR TEMPLE delhi | City: DELHI | State: DELHI | ZIP: 110001 | Address1: PLOT 510, NEAR TEMPLE delhi | City1: DELHI | State1: DELHI | ZIP1: 110001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9288650860 | Phone1: | Phone2: | Phone3: | Phone4: | Email: vikramgupta@yahoo.com | Email1: | Email2: | Email3: | Email4: ',
]
)
# [{'corpus_id': ..., 'score': ...}, {'corpus_id': ..., 'score': ...}, ...]
Evaluation
Metrics
Cross Encoder Classification
- Dataset:
rich-entity-matching-lora - Evaluated with
CrossEncoderClassificationEvaluator
| Metric | Value |
|---|---|
| accuracy | 1.0 |
| accuracy_threshold | 0.4994 |
| f1 | 1.0 |
| f1_threshold | 0.4994 |
| precision | 1.0 |
| recall | 1.0 |
| average_precision | 1.0 |
Training Details
Training Dataset
Unnamed Dataset
- Size: 126 training samples
- Columns:
sentence1,sentence2, andlabel - Approximate statistics based on the first 126 samples:
sentence1 sentence2 label type string string int details - min: 622 characters
- mean: 660.92 characters
- max: 715 characters
- min: 627 characters
- mean: 664.96 characters
- max: 731 characters
- 0: ~53.97%
- 1: ~46.03%
- Samples:
sentence1 sentence2 label Name: Arjun Singh | First: Arjun | Middle: | Last: Singh | Gender: M | DOB: 1997-12-22 | Spouse: | Mother: | Father: | Company: RELIANCE JIO | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 910 KORAMANGALA NEAR MALL | City: PUNE | State: MAHARASHTRA | ZIP: 411092 | Address1: 910 KORAMANGALA NEAR MALL | City1: PUNE | State1: MAHARASHTRA | ZIP1: 411092 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: arjunsingh@workmail.com | Email1: | Email2: | Email3: | Email4:Name: Swati Nair | First: Swati | Middle: | Last: Nair | Gender: F | DOB: 2003-05-24 | Spouse: | Mother: | Father: | Company: ZOMATO | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 108 MAIN BAZAR NEAR SCHOOL | City: CHENNAI | State: TAMIL NADU | ZIP: 600078 | Address1: 108 MAIN BAZAR NEAR SCHOOL | City1: CHENNAI | State1: TAMIL NADU | ZIP1: 600078 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 7185520040 | Phone1: | Phone2: | Phone3: | Phone4: | Email: swatinair@yahoo.com | Email1: | Email2: | Email3: | Email4:0Name: Rahul Iyer | First: Rahul | Middle: | Last: Iyer | Gender: M | DOB: 1980-02-25 | Spouse: | Mother: | Father: | Company: MANUFACTURING CO | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: FLAT 271, PARK APT, 32 MAIN RD | City: BANGALORE | State: KARNATAKA | ZIP: 560001 | Address1: FLAT 271, PARK APT, 32 MAIN RD | City1: BANGALORE | State1: KARNATAKA | ZIP1: 560001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9162959284 | Phone1: | Phone2: | Phone3: | Phone4: | Email: rahuliyer@gmail.com | Email1: rahul.iyer@manufacturingco.com | Email2: | Email3: | Email4:Name: Rahul Iyer | First: Rahul | Middle: | Last: Iyer | Gender: M | DOB: 1980-02-25 | Spouse: | Mother: | Father: | Company: MANUFACTURING CO PVT LTD | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: FLAT 271, PARK APT, 32 MAIN RD | City: BANGALORE | State: KARNATAKA | ZIP: 560001 | Address1: FLAT 271, PARK APT, 32 MAIN RD | City1: BANGALORE | State1: KARNATAKA | ZIP1: 560001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9162959284 | Phone1: | Phone2: | Phone3: | Phone4: | Email: rahuliyer@gmail.com | Email1: rahul.iyer@manufacturingco.com | Email2: | Email3: | Email4:1Name: Amit Reddy | First: Amit | Middle: | Last: Reddy | Gender: M | DOB: 1994-06-17 | Spouse: | Mother: | Father: | Company: AUTO PARTS PVT | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: HOUSE 112, SECTOR 12 kanpur | City: KANPUR | State: UTTAR PRADESH | ZIP: 208001 | Address1: HOUSE 112, SECTOR 12 kanpur | City1: KANPUR | State1: UTTAR PRADESH | ZIP1: 208001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9337887233 | Phone1: | Phone2: | Phone3: | Phone4: | Email: amitreddy@gmail.com | Email1: | Email2: | Email3: | Email4:Name: Pradeep Gupta | First: Pradeep | Middle: | Last: Gupta | Gender: M | DOB: 2004-11-01 | Spouse: | Mother: | Father: | Company: MANUFACTURING CO | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: PLOT 491, NEAR TEMPLE delhi | City: DELHI | State: DELHI | ZIP: 110001 | Address1: PLOT 491, NEAR TEMPLE delhi | City1: DELHI | State1: DELHI | ZIP1: 110001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9836691577 | Phone1: | Phone2: | Phone3: | Phone4: | Email: pradeepgupta@yahoo.com | Email1: | Email2: | Email3: | Email4:0 - Loss:
BinaryCrossEntropyLosswith these parameters:{ "activation_fn": "torch.nn.modules.linear.Identity", "pos_weight": null }
Evaluation Dataset
Unnamed Dataset
- Size: 32 evaluation samples
- Columns:
sentence1,sentence2, andlabel - Approximate statistics based on the first 32 samples:
sentence1 sentence2 label type string string int details - min: 620 characters
- mean: 661.31 characters
- max: 708 characters
- min: 628 characters
- mean: 660.66 characters
- max: 720 characters
- 0: ~53.12%
- 1: ~46.88%
- Samples:
sentence1 sentence2 label Name: Rohan Rao Sharma | First: Rohan | Middle: Rao | Last: Sharma | Gender: M | DOB: 1983-08-04 | Spouse: | Mother: | Father: | Company: ORACLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 551 KORAMANGALA NEAR MALL | City: AHMEDABAD | State: GUJARAT | ZIP: 380071 | Address1: 551 KORAMANGALA NEAR MALL | City1: AHMEDABAD | State1: GUJARAT | ZIP1: 380071 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 8641615633 | Phone1: | Phone2: | Phone3: | Phone4: | Email: rohan.sharma@gmail.com | Email1: | Email2: | Email3: | Email4:Name: Rohan Rao Sharma | First: Rohan | Middle: Rao | Last: Sharma | Gender: M | DOB: 1983-08-04 | Spouse: | Mother: | Father: | Company: ORACLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 551 KORAMANGALA NEAR MALL | City: AHMEDABAD | State: GUJARAT | ZIP: 380071 | Address1: 551 KORAMANGALA NEAR MALL | City1: AHMEDABAD | State1: GUJARAT | ZIP1: 380071 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: rohan.sharma@yahoo.com | Email1: | Email2: | Email3: | Email4:1Name: Priya Prasad Reddy | First: Priya | Middle: Prasad | Last: Reddy | Gender: F | DOB: 1983-07-03 | Spouse: | Mother: | Father: | Company: GLOBAL INFOTECH | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: FLAT 401, LAKE APT, 24 MAIN RD | City: LUCKNOW | State: UTTAR PRADESH | ZIP: 226001 | Address1: FLAT 401, LAKE APT, 24 MAIN RD | City1: LUCKNOW | State1: UTTAR PRADESH | ZIP1: 226001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9149203558 | Phone1: | Phone2: | Phone3: | Phone4: | Email: priyareddy@gmail.com | Email1: priya.reddy@globalinfotech.com | Email2: | Email3: | Email4:Name: Priya Reddy | First: Priya | Middle: | Last: Reddy | Gender: F | DOB: 1983-07-03 | Spouse: | Mother: | Father: | Company: GLOBAL INFOTECH PVT LTD | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD | City: LUCKNOW | State: UTTAR PRADESH | ZIP: 226001 | Address1: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD | City1: LUCKNOW | State1: UTTAR PRADESH | ZIP1: 226001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9208449460 | Phone1: | Phone2: | Phone3: | Phone4: | Email: priyareddy@gmail.com | Email1: priya.reddy@globalinfotech.com | Email2: | Email3: | Email4:1Name: Aditya Singh | First: Aditya | Middle: | Last: Singh | Gender: M | DOB: 1982-02-07 | Spouse: | Mother: | Father: | Company: CAPGEMINI INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 753 SECTOR 12 NEAR PARK | City: COIMBATORE | State: TAMIL NADU | ZIP: 641099 | Address1: 753 SECTOR 12 NEAR PARK | City1: COIMBATORE | State1: TAMIL NADU | ZIP1: 641099 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: adityasingh@company.in | Email1: | Email2: | Email3: | Email4:Name: Kiran Chopra | First: Kiran | Middle: | Last: Chopra | Gender: M | DOB: 1986-04-13 | Spouse: | Mother: | Father: | Company: GOOGLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 524 MG ROAD NEAR SCHOOL | City: NAGPUR | State: MAHARASHTRA | ZIP: 440013 | Address1: 524 MG ROAD NEAR SCHOOL | City1: NAGPUR | State1: MAHARASHTRA | ZIP1: 440013 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: kiranchopra@company.in | Email1: | Email2: | Email3: | Email4:0 - Loss:
BinaryCrossEntropyLosswith these parameters:{ "activation_fn": "torch.nn.modules.linear.Identity", "pos_weight": null }
Training Hyperparameters
Non-Default Hyperparameters
eval_strategy: stepsper_device_train_batch_size: 4gradient_accumulation_steps: 4learning_rate: 3e-05weight_decay: 0.05warmup_ratio: 0.2warmup_steps: 0.2dataloader_num_workers: 3load_best_model_at_end: Truedataloader_pin_memory: Falsedataloader_persistent_workers: True
All Hyperparameters
Click to expand
do_predict: Falseeval_strategy: stepsprediction_loss_only: Trueper_device_train_batch_size: 4per_device_eval_batch_size: 8gradient_accumulation_steps: 4eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 3e-05weight_decay: 0.05adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1.0num_train_epochs: 3max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: Nonewarmup_ratio: 0.2warmup_steps: 0.2log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Trueenable_jit_checkpoint: Falsesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseuse_cpu: Falseseed: 42data_seed: Nonebf16: Falsefp16: Falsebf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: -1ddp_backend: Nonedebug: []dataloader_drop_last: Falsedataloader_num_workers: 3dataloader_prefetch_factor: Nonedisable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Trueignore_data_skip: Falsefsdp: []fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}parallelism_config: Nonedeepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torch_fusedoptim_args: Nonegroup_by_length: Falselength_column_name: lengthproject: huggingfacetrackio_space_id: trackioddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Falsedataloader_persistent_workers: Trueskip_memory_metrics: Truepush_to_hub: Falseresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Nonehub_always_push: Falsehub_revision: Nonegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_for_metrics: []eval_do_concat_batches: Trueauto_find_batch_size: Falsefull_determinism: Falseddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneinclude_num_input_tokens_seen: noneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseliger_kernel_config: Noneeval_use_gather_object: Falseaverage_tokens_across_devices: Trueuse_cache: Falseprompts: Nonebatch_sampler: batch_samplermulti_dataset_batch_sampler: proportionalrouter_mapping: {}learning_rate_mapping: {}
Training Logs
| Epoch | Step | Training Loss | Validation Loss | rich-entity-matching-lora_average_precision |
|---|---|---|---|---|
| 0.5 | 4 | 0.0774 | - | - |
| 1.0 | 8 | 0.0427 | - | - |
| 1.5 | 12 | 0.0678 | - | - |
| 1.875 | 15 | - | 0.0067 | 1.0 |
| 2.0 | 16 | 0.0284 | - | - |
| 2.5 | 20 | 0.0091 | - | - |
| 3.0 | 24 | 0.0860 | - | - |
Framework Versions
- Python: 3.12.12
- Sentence Transformers: 5.2.3
- Transformers: 5.0.0
- PyTorch: 2.10.0+cpu
- Accelerate: 1.12.0
- Datasets: 4.8.3
- Tokenizers: 0.22.2
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}