Sentence Similarity
Safetensors
sentence-transformers
Amharic
PyLate
xlm-roberta
ColBERT
feature-extraction
Generated from Trainer
dataset_size:76474
loss:Contrastive
Eval Results (legacy)
text-embeddings-inference
Instructions to use rasyosef/colbert-roberta-amharic-medium with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use rasyosef/colbert-roberta-amharic-medium with sentence-transformers:
from pylate import models queries = [ "Which planet is known as the Red Planet?", "What is the largest planet in our solar system?", ] documents = [ ["Mars is the Red Planet.", "Venus is Earth's twin."], ["Jupiter is the largest planet.", "Saturn has rings."], ] model = models.ColBERT(model_name_or_path="rasyosef/colbert-roberta-amharic-medium") queries_emb = model.encode(queries, is_query=True) docs_emb = model.encode(documents, is_query=False) - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -253,6 +253,8 @@ You can finetune this model on your own dataset.
|
|
| 253 |
|
| 254 |
## Training Details
|
| 255 |
|
|
|
|
|
|
|
| 256 |
### Training Dataset
|
| 257 |
|
| 258 |
#### Unnamed Dataset
|
|
@@ -456,6 +458,7 @@ You can finetune this model on your own dataset.
|
|
| 456 |
| 3.7657 | 9000 | - | 0.1922 | - |
|
| 457 |
| 0 | 0 | - | - | 0.9754 |
|
| 458 |
|
|
|
|
| 459 |
|
| 460 |
### Framework Versions
|
| 461 |
- Python: 3.11.12
|
|
@@ -470,28 +473,12 @@ You can finetune this model on your own dataset.
|
|
| 470 |
|
| 471 |
## Citation
|
| 472 |
|
| 473 |
-
### BibTeX
|
| 474 |
-
|
| 475 |
-
#### Sentence Transformers
|
| 476 |
-
```bibtex
|
| 477 |
-
@inproceedings{reimers-2019-sentence-bert,
|
| 478 |
-
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
|
| 479 |
-
author = "Reimers, Nils and Gurevych, Iryna",
|
| 480 |
-
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
|
| 481 |
-
month = "11",
|
| 482 |
-
year = "2019",
|
| 483 |
-
publisher = "Association for Computational Linguistics",
|
| 484 |
-
url = "https://arxiv.org/abs/1908.10084"
|
| 485 |
-
}
|
| 486 |
```
|
| 487 |
-
|
| 488 |
-
|
| 489 |
-
|
| 490 |
-
|
| 491 |
-
|
| 492 |
-
author={Chaffin, Antoine and Sourty, Raphaël},
|
| 493 |
-
url={https://github.com/lightonai/pylate},
|
| 494 |
-
year={2024}
|
| 495 |
}
|
| 496 |
```
|
| 497 |
|
|
|
|
| 253 |
|
| 254 |
## Training Details
|
| 255 |
|
| 256 |
+
<details><summary>Click to expand</summary>
|
| 257 |
+
|
| 258 |
### Training Dataset
|
| 259 |
|
| 260 |
#### Unnamed Dataset
|
|
|
|
| 458 |
| 3.7657 | 9000 | - | 0.1922 | - |
|
| 459 |
| 0 | 0 | - | - | 0.9754 |
|
| 460 |
|
| 461 |
+
</details>
|
| 462 |
|
| 463 |
### Framework Versions
|
| 464 |
- Python: 3.11.12
|
|
|
|
| 473 |
|
| 474 |
## Citation
|
| 475 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 476 |
```
|
| 477 |
+
@inproceedings{mekonnen2025amharic,
|
| 478 |
+
title={Optimized Text Embedding Models and Benchmarks for Amharic Passage Retrieval},
|
| 479 |
+
author={Kidist Amde Mekonnen, Yosef Worku Alemneh, Maarten de Rijke },
|
| 480 |
+
booktitle={Findings of ACL},
|
| 481 |
+
year={2025}
|
|
|
|
|
|
|
|
|
|
| 482 |
}
|
| 483 |
```
|
| 484 |
|