Text Ranking
sentence-transformers
Safetensors
xlm-roberta
cross-encoder
reranker
Generated from Trainer
dataset_size:126
loss:BinaryCrossEntropyLoss
Eval Results (legacy)
text-embeddings-inference
Instructions to use pujithapsx/finetuned_bge_reranker_m3_merged_fullrecord_v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use pujithapsx/finetuned_bge_reranker_m3_merged_fullrecord_v1 with sentence-transformers:
from sentence_transformers import CrossEncoder model = CrossEncoder("pujithapsx/finetuned_bge_reranker_m3_merged_fullrecord_v1") query = "Which planet is known as the Red Planet?" passages = [ "Venus is often called Earth's twin because of its similar size and proximity.", "Mars, known for its reddish appearance, is often referred to as the Red Planet.", "Jupiter, the largest planet in our solar system, has a prominent red spot.", "Saturn, famous for its rings, is sometimes mistaken for the Red Planet." ] scores = model.predict([(query, passage) for passage in passages]) print(scores) - Notebooks
- Google Colab
- Kaggle
Production-ready merged model - LoRA fine-tuned on rich Indian entity resolution dataset (211 records)
Browse files- .gitattributes +1 -0
- README.md +396 -0
- config.json +40 -0
- model.safetensors +3 -0
- tokenizer.json +3 -0
- tokenizer_config.json +16 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
|
@@ -0,0 +1,396 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
tags:
|
| 3 |
+
- sentence-transformers
|
| 4 |
+
- cross-encoder
|
| 5 |
+
- reranker
|
| 6 |
+
- generated_from_trainer
|
| 7 |
+
- dataset_size:126
|
| 8 |
+
- loss:BinaryCrossEntropyLoss
|
| 9 |
+
base_model: BAAI/bge-reranker-v2-m3
|
| 10 |
+
pipeline_tag: text-ranking
|
| 11 |
+
library_name: sentence-transformers
|
| 12 |
+
metrics:
|
| 13 |
+
- accuracy
|
| 14 |
+
- accuracy_threshold
|
| 15 |
+
- f1
|
| 16 |
+
- f1_threshold
|
| 17 |
+
- precision
|
| 18 |
+
- recall
|
| 19 |
+
- average_precision
|
| 20 |
+
model-index:
|
| 21 |
+
- name: CrossEncoder based on BAAI/bge-reranker-v2-m3
|
| 22 |
+
results:
|
| 23 |
+
- task:
|
| 24 |
+
type: cross-encoder-classification
|
| 25 |
+
name: Cross Encoder Classification
|
| 26 |
+
dataset:
|
| 27 |
+
name: rich entity matching lora
|
| 28 |
+
type: rich-entity-matching-lora
|
| 29 |
+
metrics:
|
| 30 |
+
- type: accuracy
|
| 31 |
+
value: 1.0
|
| 32 |
+
name: Accuracy
|
| 33 |
+
- type: accuracy_threshold
|
| 34 |
+
value: 0.49942895770072937
|
| 35 |
+
name: Accuracy Threshold
|
| 36 |
+
- type: f1
|
| 37 |
+
value: 1.0
|
| 38 |
+
name: F1
|
| 39 |
+
- type: f1_threshold
|
| 40 |
+
value: 0.49942895770072937
|
| 41 |
+
name: F1 Threshold
|
| 42 |
+
- type: precision
|
| 43 |
+
value: 1.0
|
| 44 |
+
name: Precision
|
| 45 |
+
- type: recall
|
| 46 |
+
value: 1.0
|
| 47 |
+
name: Recall
|
| 48 |
+
- type: average_precision
|
| 49 |
+
value: 1.0
|
| 50 |
+
name: Average Precision
|
| 51 |
+
---
|
| 52 |
+
|
| 53 |
+
# CrossEncoder based on BAAI/bge-reranker-v2-m3
|
| 54 |
+
|
| 55 |
+
This is a [Cross Encoder](https://www.sbert.net/docs/cross_encoder/usage/usage.html) model finetuned from [BAAI/bge-reranker-v2-m3](https://huggingface.co/BAAI/bge-reranker-v2-m3) using the [sentence-transformers](https://www.SBERT.net) library. It computes scores for pairs of texts, which can be used for text reranking and semantic search.
|
| 56 |
+
|
| 57 |
+
## Model Details
|
| 58 |
+
|
| 59 |
+
### Model Description
|
| 60 |
+
- **Model Type:** Cross Encoder
|
| 61 |
+
- **Base model:** [BAAI/bge-reranker-v2-m3](https://huggingface.co/BAAI/bge-reranker-v2-m3) <!-- at revision 953dc6f6f85a1b2dbfca4c34a2796e7dde08d41e -->
|
| 62 |
+
- **Maximum Sequence Length:** 512 tokens
|
| 63 |
+
- **Number of Output Labels:** 1 label
|
| 64 |
+
<!-- - **Training Dataset:** Unknown -->
|
| 65 |
+
<!-- - **Language:** Unknown -->
|
| 66 |
+
<!-- - **License:** Unknown -->
|
| 67 |
+
|
| 68 |
+
### Model Sources
|
| 69 |
+
|
| 70 |
+
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
|
| 71 |
+
- **Documentation:** [Cross Encoder Documentation](https://www.sbert.net/docs/cross_encoder/usage/usage.html)
|
| 72 |
+
- **Repository:** [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
|
| 73 |
+
- **Hugging Face:** [Cross Encoders on Hugging Face](https://huggingface.co/models?library=sentence-transformers&other=cross-encoder)
|
| 74 |
+
|
| 75 |
+
## Usage
|
| 76 |
+
|
| 77 |
+
### Direct Usage (Sentence Transformers)
|
| 78 |
+
|
| 79 |
+
First install the Sentence Transformers library:
|
| 80 |
+
|
| 81 |
+
```bash
|
| 82 |
+
pip install -U sentence-transformers
|
| 83 |
+
```
|
| 84 |
+
|
| 85 |
+
Then you can load this model and run inference.
|
| 86 |
+
```python
|
| 87 |
+
from sentence_transformers import CrossEncoder
|
| 88 |
+
|
| 89 |
+
# Download from the 🤗 Hub
|
| 90 |
+
model = CrossEncoder("pujithapsx/finetuned_bge_reranker_m3_merged_fullrecord_v1")
|
| 91 |
+
# Get scores for pairs of texts
|
| 92 |
+
pairs = [
|
| 93 |
+
['Name: Rohan Rao Sharma | First: Rohan | Middle: Rao | Last: Sharma | Gender: M | DOB: 1983-08-04 | Spouse: | Mother: | Father: | Company: ORACLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 551 KORAMANGALA NEAR MALL | City: AHMEDABAD | State: GUJARAT | ZIP: 380071 | Address1: 551 KORAMANGALA NEAR MALL | City1: AHMEDABAD | State1: GUJARAT | ZIP1: 380071 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 8641615633 | Phone1: | Phone2: | Phone3: | Phone4: | Email: rohan.sharma@gmail.com | Email1: | Email2: | Email3: | Email4: ', 'Name: Rohan Rao Sharma | First: Rohan | Middle: Rao | Last: Sharma | Gender: M | DOB: 1983-08-04 | Spouse: | Mother: | Father: | Company: ORACLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 551 KORAMANGALA NEAR MALL | City: AHMEDABAD | State: GUJARAT | ZIP: 380071 | Address1: 551 KORAMANGALA NEAR MALL | City1: AHMEDABAD | State1: GUJARAT | ZIP1: 380071 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: rohan.sharma@yahoo.com | Email1: | Email2: | Email3: | Email4: '],
|
| 94 |
+
['Name: Priya Prasad Reddy | First: Priya | Middle: Prasad | Last: Reddy | Gender: F | DOB: 1983-07-03 | Spouse: | Mother: | Father: | Company: GLOBAL INFOTECH | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: FLAT 401, LAKE APT, 24 MAIN RD | City: LUCKNOW | State: UTTAR PRADESH | ZIP: 226001 | Address1: FLAT 401, LAKE APT, 24 MAIN RD | City1: LUCKNOW | State1: UTTAR PRADESH | ZIP1: 226001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9149203558 | Phone1: | Phone2: | Phone3: | Phone4: | Email: priyareddy@gmail.com | Email1: priya.reddy@globalinfotech.com | Email2: | Email3: | Email4: ', 'Name: Priya Reddy | First: Priya | Middle: | Last: Reddy | Gender: F | DOB: 1983-07-03 | Spouse: | Mother: | Father: | Company: GLOBAL INFOTECH PVT LTD | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD | City: LUCKNOW | State: UTTAR PRADESH | ZIP: 226001 | Address1: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD | City1: LUCKNOW | State1: UTTAR PRADESH | ZIP1: 226001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9208449460 | Phone1: | Phone2: | Phone3: | Phone4: | Email: priyareddy@gmail.com | Email1: priya.reddy@globalinfotech.com | Email2: | Email3: | Email4: '],
|
| 95 |
+
['Name: Aditya Singh | First: Aditya | Middle: | Last: Singh | Gender: M | DOB: 1982-02-07 | Spouse: | Mother: | Father: | Company: CAPGEMINI INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 753 SECTOR 12 NEAR PARK | City: COIMBATORE | State: TAMIL NADU | ZIP: 641099 | Address1: 753 SECTOR 12 NEAR PARK | City1: COIMBATORE | State1: TAMIL NADU | ZIP1: 641099 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: adityasingh@company.in | Email1: | Email2: | Email3: | Email4: ', 'Name: Kiran Chopra | First: Kiran | Middle: | Last: Chopra | Gender: M | DOB: 1986-04-13 | Spouse: | Mother: | Father: | Company: GOOGLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 524 MG ROAD NEAR SCHOOL | City: NAGPUR | State: MAHARASHTRA | ZIP: 440013 | Address1: 524 MG ROAD NEAR SCHOOL | City1: NAGPUR | State1: MAHARASHTRA | ZIP1: 440013 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: kiranchopra@company.in | Email1: | Email2: | Email3: | Email4: '],
|
| 96 |
+
['Name: Divya Patel | First: Divya | Middle: | Last: Patel | Gender: F | DOB: 1985-02-01 | Spouse: | Mother: | Father: | Company: INFOSYS TECHNOLOGIES | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 316 SECTOR 12 NEAR PARK | City: HYDERABAD | State: TELANGANA | ZIP: 500048 | Address1: 316 SECTOR 12 NEAR PARK | City1: HYDERABAD | State1: TELANGANA | ZIP1: 500048 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 8883181644 | Phone1: | Phone2: | Phone3: | Phone4: | Email: divyapatel@outlook.com | Email1: | Email2: | Email3: | Email4: ', 'Name: Nitin Rao | First: Nitin | Middle: | Last: Rao | Gender: M | DOB: 1987-12-02 | Spouse: | Mother: | Father: | Company: RELIANCE JIO | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 612 BANJARA HILLS NEAR PARK | City: COIMBATORE | State: TAMIL NADU | ZIP: 641039 | Address1: 612 BANJARA HILLS NEAR PARK | City1: COIMBATORE | State1: TAMIL NADU | ZIP1: 641039 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9358550137 | Phone1: | Phone2: | Phone3: | Phone4: | Email: nitinrao@gmail.com | Email1: | Email2: | Email3: | Email4: '],
|
| 97 |
+
['Name: Neha Gupta | First: Neha | Middle: | Last: Gupta | Gender: F | DOB: 1981-04-13 | Spouse: | Mother: | Father: | Company: HEALTHCARE SYSTEMS | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: HOUSE 57, BLOCK A kanpur | City: KANPUR | State: UTTAR PRADESH | ZIP: 208001 | Address1: HOUSE 57, BLOCK A kanpur | City1: KANPUR | State1: UTTAR PRADESH | ZIP1: 208001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9423646570 | Phone1: | Phone2: | Phone3: | Phone4: | Email: nehagupta@gmail.com | Email1: | Email2: | Email3: | Email4: ', 'Name: Vikram Gupta | First: Vikram | Middle: | Last: Gupta | Gender: M | DOB: 1993-06-04 | Spouse: | Mother: | Father: | Company: TECH SOLUTIONS PVT LTD | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: PLOT 510, NEAR TEMPLE delhi | City: DELHI | State: DELHI | ZIP: 110001 | Address1: PLOT 510, NEAR TEMPLE delhi | City1: DELHI | State1: DELHI | ZIP1: 110001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9288650860 | Phone1: | Phone2: | Phone3: | Phone4: | Email: vikramgupta@yahoo.com | Email1: | Email2: | Email3: | Email4: '],
|
| 98 |
+
]
|
| 99 |
+
scores = model.predict(pairs)
|
| 100 |
+
print(scores.shape)
|
| 101 |
+
# (5,)
|
| 102 |
+
|
| 103 |
+
# Or rank different texts based on similarity to a single text
|
| 104 |
+
ranks = model.rank(
|
| 105 |
+
'Name: Rohan Rao Sharma | First: Rohan | Middle: Rao | Last: Sharma | Gender: M | DOB: 1983-08-04 | Spouse: | Mother: | Father: | Company: ORACLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 551 KORAMANGALA NEAR MALL | City: AHMEDABAD | State: GUJARAT | ZIP: 380071 | Address1: 551 KORAMANGALA NEAR MALL | City1: AHMEDABAD | State1: GUJARAT | ZIP1: 380071 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 8641615633 | Phone1: | Phone2: | Phone3: | Phone4: | Email: rohan.sharma@gmail.com | Email1: | Email2: | Email3: | Email4: ',
|
| 106 |
+
[
|
| 107 |
+
'Name: Rohan Rao Sharma | First: Rohan | Middle: Rao | Last: Sharma | Gender: M | DOB: 1983-08-04 | Spouse: | Mother: | Father: | Company: ORACLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 551 KORAMANGALA NEAR MALL | City: AHMEDABAD | State: GUJARAT | ZIP: 380071 | Address1: 551 KORAMANGALA NEAR MALL | City1: AHMEDABAD | State1: GUJARAT | ZIP1: 380071 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: rohan.sharma@yahoo.com | Email1: | Email2: | Email3: | Email4: ',
|
| 108 |
+
'Name: Priya Reddy | First: Priya | Middle: | Last: Reddy | Gender: F | DOB: 1983-07-03 | Spouse: | Mother: | Father: | Company: GLOBAL INFOTECH PVT LTD | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD | City: LUCKNOW | State: UTTAR PRADESH | ZIP: 226001 | Address1: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD | City1: LUCKNOW | State1: UTTAR PRADESH | ZIP1: 226001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9208449460 | Phone1: | Phone2: | Phone3: | Phone4: | Email: priyareddy@gmail.com | Email1: priya.reddy@globalinfotech.com | Email2: | Email3: | Email4: ',
|
| 109 |
+
'Name: Kiran Chopra | First: Kiran | Middle: | Last: Chopra | Gender: M | DOB: 1986-04-13 | Spouse: | Mother: | Father: | Company: GOOGLE INDIA | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 524 MG ROAD NEAR SCHOOL | City: NAGPUR | State: MAHARASHTRA | ZIP: 440013 | Address1: 524 MG ROAD NEAR SCHOOL | City1: NAGPUR | State1: MAHARASHTRA | ZIP1: 440013 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: | Phone1: | Phone2: | Phone3: | Phone4: | Email: kiranchopra@company.in | Email1: | Email2: | Email3: | Email4: ',
|
| 110 |
+
'Name: Nitin Rao | First: Nitin | Middle: | Last: Rao | Gender: M | DOB: 1987-12-02 | Spouse: | Mother: | Father: | Company: RELIANCE JIO | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: 612 BANJARA HILLS NEAR PARK | City: COIMBATORE | State: TAMIL NADU | ZIP: 641039 | Address1: 612 BANJARA HILLS NEAR PARK | City1: COIMBATORE | State1: TAMIL NADU | ZIP1: 641039 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9358550137 | Phone1: | Phone2: | Phone3: | Phone4: | Email: nitinrao@gmail.com | Email1: | Email2: | Email3: | Email4: ',
|
| 111 |
+
'Name: Vikram Gupta | First: Vikram | Middle: | Last: Gupta | Gender: M | DOB: 1993-06-04 | Spouse: | Mother: | Father: | Company: TECH SOLUTIONS PVT LTD | ParentCompany: | TaxID: | LicenseID: | PassportID: | Address: PLOT 510, NEAR TEMPLE delhi | City: DELHI | State: DELHI | ZIP: 110001 | Address1: PLOT 510, NEAR TEMPLE delhi | City1: DELHI | State1: DELHI | ZIP1: 110001 | Address2: | City2: | State2: | ZIP2: | Address3: | City3: | State3: | ZIP3: | Address4: | City4: | State4: | ZIP4: | Phone: 9288650860 | Phone1: | Phone2: | Phone3: | Phone4: | Email: vikramgupta@yahoo.com | Email1: | Email2: | Email3: | Email4: ',
|
| 112 |
+
]
|
| 113 |
+
)
|
| 114 |
+
# [{'corpus_id': ..., 'score': ...}, {'corpus_id': ..., 'score': ...}, ...]
|
| 115 |
+
```
|
| 116 |
+
|
| 117 |
+
<!--
|
| 118 |
+
### Direct Usage (Transformers)
|
| 119 |
+
|
| 120 |
+
<details><summary>Click to see the direct usage in Transformers</summary>
|
| 121 |
+
|
| 122 |
+
</details>
|
| 123 |
+
-->
|
| 124 |
+
|
| 125 |
+
<!--
|
| 126 |
+
### Downstream Usage (Sentence Transformers)
|
| 127 |
+
|
| 128 |
+
You can finetune this model on your own dataset.
|
| 129 |
+
|
| 130 |
+
<details><summary>Click to expand</summary>
|
| 131 |
+
|
| 132 |
+
</details>
|
| 133 |
+
-->
|
| 134 |
+
|
| 135 |
+
<!--
|
| 136 |
+
### Out-of-Scope Use
|
| 137 |
+
|
| 138 |
+
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
|
| 139 |
+
-->
|
| 140 |
+
|
| 141 |
+
## Evaluation
|
| 142 |
+
|
| 143 |
+
### Metrics
|
| 144 |
+
|
| 145 |
+
#### Cross Encoder Classification
|
| 146 |
+
|
| 147 |
+
* Dataset: `rich-entity-matching-lora`
|
| 148 |
+
* Evaluated with [<code>CrossEncoderClassificationEvaluator</code>](https://sbert.net/docs/package_reference/cross_encoder/evaluation.html#sentence_transformers.cross_encoder.evaluation.CrossEncoderClassificationEvaluator)
|
| 149 |
+
|
| 150 |
+
| Metric | Value |
|
| 151 |
+
|:----------------------|:--------|
|
| 152 |
+
| accuracy | 1.0 |
|
| 153 |
+
| accuracy_threshold | 0.4994 |
|
| 154 |
+
| f1 | 1.0 |
|
| 155 |
+
| f1_threshold | 0.4994 |
|
| 156 |
+
| precision | 1.0 |
|
| 157 |
+
| recall | 1.0 |
|
| 158 |
+
| **average_precision** | **1.0** |
|
| 159 |
+
|
| 160 |
+
<!--
|
| 161 |
+
## Bias, Risks and Limitations
|
| 162 |
+
|
| 163 |
+
*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
|
| 164 |
+
-->
|
| 165 |
+
|
| 166 |
+
<!--
|
| 167 |
+
### Recommendations
|
| 168 |
+
|
| 169 |
+
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
|
| 170 |
+
-->
|
| 171 |
+
|
| 172 |
+
## Training Details
|
| 173 |
+
|
| 174 |
+
### Training Dataset
|
| 175 |
+
|
| 176 |
+
#### Unnamed Dataset
|
| 177 |
+
|
| 178 |
+
* Size: 126 training samples
|
| 179 |
+
* Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
|
| 180 |
+
* Approximate statistics based on the first 126 samples:
|
| 181 |
+
| | sentence1 | sentence2 | label |
|
| 182 |
+
|:--------|:--------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------|:------------------------------------------------|
|
| 183 |
+
| type | string | string | int |
|
| 184 |
+
| details | <ul><li>min: 622 characters</li><li>mean: 660.92 characters</li><li>max: 715 characters</li></ul> | <ul><li>min: 627 characters</li><li>mean: 664.96 characters</li><li>max: 731 characters</li></ul> | <ul><li>0: ~53.97%</li><li>1: ~46.03%</li></ul> |
|
| 185 |
+
* Samples:
|
| 186 |
+
| sentence1 | sentence2 | label |
|
| 187 |
+
|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
|
| 188 |
+
| <code>Name: Arjun Singh \| First: Arjun \| Middle: \| Last: Singh \| Gender: M \| DOB: 1997-12-22 \| Spouse: \| Mother: \| Father: \| Company: RELIANCE JIO \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: 910 KORAMANGALA NEAR MALL \| City: PUNE \| State: MAHARASHTRA \| ZIP: 411092 \| Address1: 910 KORAMANGALA NEAR MALL \| City1: PUNE \| State1: MAHARASHTRA \| ZIP1: 411092 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: arjunsingh@workmail.com \| Email1: \| Email2: \| Email3: \| Email4: </code> | <code>Name: Swati Nair \| First: Swati \| Middle: \| Last: Nair \| Gender: F \| DOB: 2003-05-24 \| Spouse: \| Mother: \| Father: \| Company: ZOMATO \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: 108 MAIN BAZAR NEAR SCHOOL \| City: CHENNAI \| State: TAMIL NADU \| ZIP: 600078 \| Address1: 108 MAIN BAZAR NEAR SCHOOL \| City1: CHENNAI \| State1: TAMIL NADU \| ZIP1: 600078 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: 7185520040 \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: swatinair@yahoo.com \| Email1: \| Email2: \| Email3: \| Email4: </code> | <code>0</code> |
|
| 189 |
+
| <code>Name: Rahul Iyer \| First: Rahul \| Middle: \| Last: Iyer \| Gender: M \| DOB: 1980-02-25 \| Spouse: \| Mother: \| Father: \| Company: MANUFACTURING CO \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: FLAT 271, PARK APT, 32 MAIN RD \| City: BANGALORE \| State: KARNATAKA \| ZIP: 560001 \| Address1: FLAT 271, PARK APT, 32 MAIN RD \| City1: BANGALORE \| State1: KARNATAKA \| ZIP1: 560001 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: 9162959284 \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: rahuliyer@gmail.com \| Email1: rahul.iyer@manufacturingco.com \| Email2: \| Email3: \| Email4: </code> | <code>Name: Rahul Iyer \| First: Rahul \| Middle: \| Last: Iyer \| Gender: M \| DOB: 1980-02-25 \| Spouse: \| Mother: \| Father: \| Company: MANUFACTURING CO PVT LTD \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: FLAT 271, PARK APT, 32 MAIN RD \| City: BANGALORE \| State: KARNATAKA \| ZIP: 560001 \| Address1: FLAT 271, PARK APT, 32 MAIN RD \| City1: BANGALORE \| State1: KARNATAKA \| ZIP1: 560001 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: 9162959284 \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: rahuliyer@gmail.com \| Email1: rahul.iyer@manufacturingco.com \| Email2: \| Email3: \| Email4: </code> | <code>1</code> |
|
| 190 |
+
| <code>Name: Amit Reddy \| First: Amit \| Middle: \| Last: Reddy \| Gender: M \| DOB: 1994-06-17 \| Spouse: \| Mother: \| Father: \| Company: AUTO PARTS PVT \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: HOUSE 112, SECTOR 12 kanpur \| City: KANPUR \| State: UTTAR PRADESH \| ZIP: 208001 \| Address1: HOUSE 112, SECTOR 12 kanpur \| City1: KANPUR \| State1: UTTAR PRADESH \| ZIP1: 208001 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: 9337887233 \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: amitreddy@gmail.com \| Email1: \| Email2: \| Email3: \| Email4: </code> | <code>Name: Pradeep Gupta \| First: Pradeep \| Middle: \| Last: Gupta \| Gender: M \| DOB: 2004-11-01 \| Spouse: \| Mother: \| Father: \| Company: MANUFACTURING CO \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: PLOT 491, NEAR TEMPLE delhi \| City: DELHI \| State: DELHI \| ZIP: 110001 \| Address1: PLOT 491, NEAR TEMPLE delhi \| City1: DELHI \| State1: DELHI \| ZIP1: 110001 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: 9836691577 \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: pradeepgupta@yahoo.com \| Email1: \| Email2: \| Email3: \| Email4: </code> | <code>0</code> |
|
| 191 |
+
* Loss: [<code>BinaryCrossEntropyLoss</code>](https://sbert.net/docs/package_reference/cross_encoder/losses.html#binarycrossentropyloss) with these parameters:
|
| 192 |
+
```json
|
| 193 |
+
{
|
| 194 |
+
"activation_fn": "torch.nn.modules.linear.Identity",
|
| 195 |
+
"pos_weight": null
|
| 196 |
+
}
|
| 197 |
+
```
|
| 198 |
+
|
| 199 |
+
### Evaluation Dataset
|
| 200 |
+
|
| 201 |
+
#### Unnamed Dataset
|
| 202 |
+
|
| 203 |
+
* Size: 32 evaluation samples
|
| 204 |
+
* Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
|
| 205 |
+
* Approximate statistics based on the first 32 samples:
|
| 206 |
+
| | sentence1 | sentence2 | label |
|
| 207 |
+
|:--------|:--------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------|:------------------------------------------------|
|
| 208 |
+
| type | string | string | int |
|
| 209 |
+
| details | <ul><li>min: 620 characters</li><li>mean: 661.31 characters</li><li>max: 708 characters</li></ul> | <ul><li>min: 628 characters</li><li>mean: 660.66 characters</li><li>max: 720 characters</li></ul> | <ul><li>0: ~53.12%</li><li>1: ~46.88%</li></ul> |
|
| 210 |
+
* Samples:
|
| 211 |
+
| sentence1 | sentence2 | label |
|
| 212 |
+
|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
|
| 213 |
+
| <code>Name: Rohan Rao Sharma \| First: Rohan \| Middle: Rao \| Last: Sharma \| Gender: M \| DOB: 1983-08-04 \| Spouse: \| Mother: \| Father: \| Company: ORACLE INDIA \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: 551 KORAMANGALA NEAR MALL \| City: AHMEDABAD \| State: GUJARAT \| ZIP: 380071 \| Address1: 551 KORAMANGALA NEAR MALL \| City1: AHMEDABAD \| State1: GUJARAT \| ZIP1: 380071 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: 8641615633 \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: rohan.sharma@gmail.com \| Email1: \| Email2: \| Email3: \| Email4: </code> | <code>Name: Rohan Rao Sharma \| First: Rohan \| Middle: Rao \| Last: Sharma \| Gender: M \| DOB: 1983-08-04 \| Spouse: \| Mother: \| Father: \| Company: ORACLE INDIA \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: 551 KORAMANGALA NEAR MALL \| City: AHMEDABAD \| State: GUJARAT \| ZIP: 380071 \| Address1: 551 KORAMANGALA NEAR MALL \| City1: AHMEDABAD \| State1: GUJARAT \| ZIP1: 380071 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: rohan.sharma@yahoo.com \| Email1: \| Email2: \| Email3: \| Email4: </code> | <code>1</code> |
|
| 214 |
+
| <code>Name: Priya Prasad Reddy \| First: Priya \| Middle: Prasad \| Last: Reddy \| Gender: F \| DOB: 1983-07-03 \| Spouse: \| Mother: \| Father: \| Company: GLOBAL INFOTECH \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: FLAT 401, LAKE APT, 24 MAIN RD \| City: LUCKNOW \| State: UTTAR PRADESH \| ZIP: 226001 \| Address1: FLAT 401, LAKE APT, 24 MAIN RD \| City1: LUCKNOW \| State1: UTTAR PRADESH \| ZIP1: 226001 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: 9149203558 \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: priyareddy@gmail.com \| Email1: priya.reddy@globalinfotech.com \| Email2: \| Email3: \| Email4: </code> | <code>Name: Priya Reddy \| First: Priya \| Middle: \| Last: Reddy \| Gender: F \| DOB: 1983-07-03 \| Spouse: \| Mother: \| Father: \| Company: GLOBAL INFOTECH PVT LTD \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD \| City: LUCKNOW \| State: UTTAR PRADESH \| ZIP: 226001 \| Address1: FLAT 401, LAKE APARTMENT, 24 MAIN ROAD \| City1: LUCKNOW \| State1: UTTAR PRADESH \| ZIP1: 226001 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: 9208449460 \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: priyareddy@gmail.com \| Email1: priya.reddy@globalinfotech.com \| Email2: \| Email3: \| Email4: </code> | <code>1</code> |
|
| 215 |
+
| <code>Name: Aditya Singh \| First: Aditya \| Middle: \| Last: Singh \| Gender: M \| DOB: 1982-02-07 \| Spouse: \| Mother: \| Father: \| Company: CAPGEMINI INDIA \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: 753 SECTOR 12 NEAR PARK \| City: COIMBATORE \| State: TAMIL NADU \| ZIP: 641099 \| Address1: 753 SECTOR 12 NEAR PARK \| City1: COIMBATORE \| State1: TAMIL NADU \| ZIP1: 641099 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: adityasingh@company.in \| Email1: \| Email2: \| Email3: \| Email4: </code> | <code>Name: Kiran Chopra \| First: Kiran \| Middle: \| Last: Chopra \| Gender: M \| DOB: 1986-04-13 \| Spouse: \| Mother: \| Father: \| Company: GOOGLE INDIA \| ParentCompany: \| TaxID: \| LicenseID: \| PassportID: \| Address: 524 MG ROAD NEAR SCHOOL \| City: NAGPUR \| State: MAHARASHTRA \| ZIP: 440013 \| Address1: 524 MG ROAD NEAR SCHOOL \| City1: NAGPUR \| State1: MAHARASHTRA \| ZIP1: 440013 \| Address2: \| City2: \| State2: \| ZIP2: \| Address3: \| City3: \| State3: \| ZIP3: \| Address4: \| City4: \| State4: \| ZIP4: \| Phone: \| Phone1: \| Phone2: \| Phone3: \| Phone4: \| Email: kiranchopra@company.in \| Email1: \| Email2: \| Email3: \| Email4: </code> | <code>0</code> |
|
| 216 |
+
* Loss: [<code>BinaryCrossEntropyLoss</code>](https://sbert.net/docs/package_reference/cross_encoder/losses.html#binarycrossentropyloss) with these parameters:
|
| 217 |
+
```json
|
| 218 |
+
{
|
| 219 |
+
"activation_fn": "torch.nn.modules.linear.Identity",
|
| 220 |
+
"pos_weight": null
|
| 221 |
+
}
|
| 222 |
+
```
|
| 223 |
+
|
| 224 |
+
### Training Hyperparameters
|
| 225 |
+
#### Non-Default Hyperparameters
|
| 226 |
+
|
| 227 |
+
- `eval_strategy`: steps
|
| 228 |
+
- `per_device_train_batch_size`: 4
|
| 229 |
+
- `gradient_accumulation_steps`: 4
|
| 230 |
+
- `learning_rate`: 3e-05
|
| 231 |
+
- `weight_decay`: 0.05
|
| 232 |
+
- `warmup_ratio`: 0.2
|
| 233 |
+
- `warmup_steps`: 0.2
|
| 234 |
+
- `dataloader_num_workers`: 3
|
| 235 |
+
- `load_best_model_at_end`: True
|
| 236 |
+
- `dataloader_pin_memory`: False
|
| 237 |
+
- `dataloader_persistent_workers`: True
|
| 238 |
+
|
| 239 |
+
#### All Hyperparameters
|
| 240 |
+
<details><summary>Click to expand</summary>
|
| 241 |
+
|
| 242 |
+
- `do_predict`: False
|
| 243 |
+
- `eval_strategy`: steps
|
| 244 |
+
- `prediction_loss_only`: True
|
| 245 |
+
- `per_device_train_batch_size`: 4
|
| 246 |
+
- `per_device_eval_batch_size`: 8
|
| 247 |
+
- `gradient_accumulation_steps`: 4
|
| 248 |
+
- `eval_accumulation_steps`: None
|
| 249 |
+
- `torch_empty_cache_steps`: None
|
| 250 |
+
- `learning_rate`: 3e-05
|
| 251 |
+
- `weight_decay`: 0.05
|
| 252 |
+
- `adam_beta1`: 0.9
|
| 253 |
+
- `adam_beta2`: 0.999
|
| 254 |
+
- `adam_epsilon`: 1e-08
|
| 255 |
+
- `max_grad_norm`: 1.0
|
| 256 |
+
- `num_train_epochs`: 3
|
| 257 |
+
- `max_steps`: -1
|
| 258 |
+
- `lr_scheduler_type`: linear
|
| 259 |
+
- `lr_scheduler_kwargs`: None
|
| 260 |
+
- `warmup_ratio`: 0.2
|
| 261 |
+
- `warmup_steps`: 0.2
|
| 262 |
+
- `log_level`: passive
|
| 263 |
+
- `log_level_replica`: warning
|
| 264 |
+
- `log_on_each_node`: True
|
| 265 |
+
- `logging_nan_inf_filter`: True
|
| 266 |
+
- `enable_jit_checkpoint`: False
|
| 267 |
+
- `save_on_each_node`: False
|
| 268 |
+
- `save_only_model`: False
|
| 269 |
+
- `restore_callback_states_from_checkpoint`: False
|
| 270 |
+
- `use_cpu`: False
|
| 271 |
+
- `seed`: 42
|
| 272 |
+
- `data_seed`: None
|
| 273 |
+
- `bf16`: False
|
| 274 |
+
- `fp16`: False
|
| 275 |
+
- `bf16_full_eval`: False
|
| 276 |
+
- `fp16_full_eval`: False
|
| 277 |
+
- `tf32`: None
|
| 278 |
+
- `local_rank`: -1
|
| 279 |
+
- `ddp_backend`: None
|
| 280 |
+
- `debug`: []
|
| 281 |
+
- `dataloader_drop_last`: False
|
| 282 |
+
- `dataloader_num_workers`: 3
|
| 283 |
+
- `dataloader_prefetch_factor`: None
|
| 284 |
+
- `disable_tqdm`: False
|
| 285 |
+
- `remove_unused_columns`: True
|
| 286 |
+
- `label_names`: None
|
| 287 |
+
- `load_best_model_at_end`: True
|
| 288 |
+
- `ignore_data_skip`: False
|
| 289 |
+
- `fsdp`: []
|
| 290 |
+
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
|
| 291 |
+
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
|
| 292 |
+
- `parallelism_config`: None
|
| 293 |
+
- `deepspeed`: None
|
| 294 |
+
- `label_smoothing_factor`: 0.0
|
| 295 |
+
- `optim`: adamw_torch_fused
|
| 296 |
+
- `optim_args`: None
|
| 297 |
+
- `group_by_length`: False
|
| 298 |
+
- `length_column_name`: length
|
| 299 |
+
- `project`: huggingface
|
| 300 |
+
- `trackio_space_id`: trackio
|
| 301 |
+
- `ddp_find_unused_parameters`: None
|
| 302 |
+
- `ddp_bucket_cap_mb`: None
|
| 303 |
+
- `ddp_broadcast_buffers`: False
|
| 304 |
+
- `dataloader_pin_memory`: False
|
| 305 |
+
- `dataloader_persistent_workers`: True
|
| 306 |
+
- `skip_memory_metrics`: True
|
| 307 |
+
- `push_to_hub`: False
|
| 308 |
+
- `resume_from_checkpoint`: None
|
| 309 |
+
- `hub_model_id`: None
|
| 310 |
+
- `hub_strategy`: every_save
|
| 311 |
+
- `hub_private_repo`: None
|
| 312 |
+
- `hub_always_push`: False
|
| 313 |
+
- `hub_revision`: None
|
| 314 |
+
- `gradient_checkpointing`: False
|
| 315 |
+
- `gradient_checkpointing_kwargs`: None
|
| 316 |
+
- `include_for_metrics`: []
|
| 317 |
+
- `eval_do_concat_batches`: True
|
| 318 |
+
- `auto_find_batch_size`: False
|
| 319 |
+
- `full_determinism`: False
|
| 320 |
+
- `ddp_timeout`: 1800
|
| 321 |
+
- `torch_compile`: False
|
| 322 |
+
- `torch_compile_backend`: None
|
| 323 |
+
- `torch_compile_mode`: None
|
| 324 |
+
- `include_num_input_tokens_seen`: no
|
| 325 |
+
- `neftune_noise_alpha`: None
|
| 326 |
+
- `optim_target_modules`: None
|
| 327 |
+
- `batch_eval_metrics`: False
|
| 328 |
+
- `eval_on_start`: False
|
| 329 |
+
- `use_liger_kernel`: False
|
| 330 |
+
- `liger_kernel_config`: None
|
| 331 |
+
- `eval_use_gather_object`: False
|
| 332 |
+
- `average_tokens_across_devices`: True
|
| 333 |
+
- `use_cache`: False
|
| 334 |
+
- `prompts`: None
|
| 335 |
+
- `batch_sampler`: batch_sampler
|
| 336 |
+
- `multi_dataset_batch_sampler`: proportional
|
| 337 |
+
- `router_mapping`: {}
|
| 338 |
+
- `learning_rate_mapping`: {}
|
| 339 |
+
|
| 340 |
+
</details>
|
| 341 |
+
|
| 342 |
+
### Training Logs
|
| 343 |
+
| Epoch | Step | Training Loss | Validation Loss | rich-entity-matching-lora_average_precision |
|
| 344 |
+
|:-----:|:----:|:-------------:|:---------------:|:-------------------------------------------:|
|
| 345 |
+
| 0.5 | 4 | 0.0774 | - | - |
|
| 346 |
+
| 1.0 | 8 | 0.0427 | - | - |
|
| 347 |
+
| 1.5 | 12 | 0.0678 | - | - |
|
| 348 |
+
| 1.875 | 15 | - | 0.0067 | 1.0 |
|
| 349 |
+
| 2.0 | 16 | 0.0284 | - | - |
|
| 350 |
+
| 2.5 | 20 | 0.0091 | - | - |
|
| 351 |
+
| 3.0 | 24 | 0.0860 | - | - |
|
| 352 |
+
|
| 353 |
+
|
| 354 |
+
### Framework Versions
|
| 355 |
+
- Python: 3.12.12
|
| 356 |
+
- Sentence Transformers: 5.2.3
|
| 357 |
+
- Transformers: 5.0.0
|
| 358 |
+
- PyTorch: 2.10.0+cpu
|
| 359 |
+
- Accelerate: 1.12.0
|
| 360 |
+
- Datasets: 4.8.3
|
| 361 |
+
- Tokenizers: 0.22.2
|
| 362 |
+
|
| 363 |
+
## Citation
|
| 364 |
+
|
| 365 |
+
### BibTeX
|
| 366 |
+
|
| 367 |
+
#### Sentence Transformers
|
| 368 |
+
```bibtex
|
| 369 |
+
@inproceedings{reimers-2019-sentence-bert,
|
| 370 |
+
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
|
| 371 |
+
author = "Reimers, Nils and Gurevych, Iryna",
|
| 372 |
+
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
|
| 373 |
+
month = "11",
|
| 374 |
+
year = "2019",
|
| 375 |
+
publisher = "Association for Computational Linguistics",
|
| 376 |
+
url = "https://arxiv.org/abs/1908.10084",
|
| 377 |
+
}
|
| 378 |
+
```
|
| 379 |
+
|
| 380 |
+
<!--
|
| 381 |
+
## Glossary
|
| 382 |
+
|
| 383 |
+
*Clearly define terms in order to be accessible across audiences.*
|
| 384 |
+
-->
|
| 385 |
+
|
| 386 |
+
<!--
|
| 387 |
+
## Model Card Authors
|
| 388 |
+
|
| 389 |
+
*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
|
| 390 |
+
-->
|
| 391 |
+
|
| 392 |
+
<!--
|
| 393 |
+
## Model Card Contact
|
| 394 |
+
|
| 395 |
+
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
|
| 396 |
+
-->
|
config.json
ADDED
|
@@ -0,0 +1,40 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"add_cross_attention": false,
|
| 3 |
+
"architectures": [
|
| 4 |
+
"XLMRobertaForSequenceClassification"
|
| 5 |
+
],
|
| 6 |
+
"attention_probs_dropout_prob": 0.1,
|
| 7 |
+
"bos_token_id": 0,
|
| 8 |
+
"classifier_dropout": null,
|
| 9 |
+
"dtype": "float32",
|
| 10 |
+
"eos_token_id": 2,
|
| 11 |
+
"hidden_act": "gelu",
|
| 12 |
+
"hidden_dropout_prob": 0.1,
|
| 13 |
+
"hidden_size": 1024,
|
| 14 |
+
"id2label": {
|
| 15 |
+
"0": "LABEL_0"
|
| 16 |
+
},
|
| 17 |
+
"initializer_range": 0.02,
|
| 18 |
+
"intermediate_size": 4096,
|
| 19 |
+
"is_decoder": false,
|
| 20 |
+
"label2id": {
|
| 21 |
+
"LABEL_0": 0
|
| 22 |
+
},
|
| 23 |
+
"layer_norm_eps": 1e-05,
|
| 24 |
+
"max_position_embeddings": 8194,
|
| 25 |
+
"model_type": "xlm-roberta",
|
| 26 |
+
"num_attention_heads": 16,
|
| 27 |
+
"num_hidden_layers": 24,
|
| 28 |
+
"output_past": true,
|
| 29 |
+
"pad_token_id": 1,
|
| 30 |
+
"position_embedding_type": "absolute",
|
| 31 |
+
"sentence_transformers": {
|
| 32 |
+
"activation_fn": "torch.nn.modules.activation.Sigmoid",
|
| 33 |
+
"version": "5.2.3"
|
| 34 |
+
},
|
| 35 |
+
"tie_word_embeddings": true,
|
| 36 |
+
"transformers_version": "5.0.0",
|
| 37 |
+
"type_vocab_size": 1,
|
| 38 |
+
"use_cache": false,
|
| 39 |
+
"vocab_size": 250002
|
| 40 |
+
}
|
model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:9791952a8573e3dc8acd68a712ebd4afae16deab23857738d30998ad028538d8
|
| 3 |
+
size 2271071804
|
tokenizer.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:127360f007d2212ba954a5d2ea45ed1337dce45988ea30e577d0644884c46d68
|
| 3 |
+
size 16781738
|
tokenizer_config.json
ADDED
|
@@ -0,0 +1,16 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"add_prefix_space": true,
|
| 3 |
+
"backend": "tokenizers",
|
| 4 |
+
"bos_token": "<s>",
|
| 5 |
+
"clean_up_tokenization_spaces": true,
|
| 6 |
+
"cls_token": "<s>",
|
| 7 |
+
"eos_token": "</s>",
|
| 8 |
+
"is_local": false,
|
| 9 |
+
"mask_token": "<mask>",
|
| 10 |
+
"model_max_length": 512,
|
| 11 |
+
"pad_token": "<pad>",
|
| 12 |
+
"sep_token": "</s>",
|
| 13 |
+
"sp_model_kwargs": {},
|
| 14 |
+
"tokenizer_class": "XLMRobertaTokenizer",
|
| 15 |
+
"unk_token": "<unk>"
|
| 16 |
+
}
|