Instructions to use prithivida/bert-for-patents-64d with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use prithivida/bert-for-patents-64d with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="prithivida/bert-for-patents-64d")# Load model directly from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("prithivida/bert-for-patents-64d") model = AutoModel.from_pretrained("prithivida/bert-for-patents-64d") - Notebooks
- Google Colab
- Kaggle
Fix yaml
Browse files
README.md
CHANGED
|
@@ -20,7 +20,7 @@ metrics:
|
|
| 20 |
# Motivation
|
| 21 |
|
| 22 |
|
| 23 |
-
This model is based on anferico/bert-for-patents - a BERT<sub>LARGE</sub> model (See details below). By default, the pre-trained model's output embeddings with size 768 (base-models) or with size 1024 (large-models). However, when you store Millions of embeddings, this can require quite a lot of memory/storage. So have reduced the embedding dimension to 64 i.e 1/16th of 1024 using Principle Component Analysis (PCA) and it still gives a comparable performance. Yes! PCA gives better performance than NMF. Note: This process neither improves the runtime, nor the memory requirement for running the model. It only reduces the needed space to store embeddings, for example, for semantic search using vector databases.
|
| 24 |
|
| 25 |
|
| 26 |
|
|
|
|
| 20 |
# Motivation
|
| 21 |
|
| 22 |
|
| 23 |
+
This model is based on anferico/bert-for-patents - a BERT<sub>LARGE</sub> model (See next section for details below). By default, the pre-trained model's output embeddings with size 768 (base-models) or with size 1024 (large-models). However, when you store Millions of embeddings, this can require quite a lot of memory/storage. So have reduced the embedding dimension to 64 i.e 1/16th of 1024 using Principle Component Analysis (PCA) and it still gives a comparable performance. Yes! PCA gives better performance than NMF. Note: This process neither improves the runtime, nor the memory requirement for running the model. It only reduces the needed space to store embeddings, for example, for semantic search using vector databases.
|
| 24 |
|
| 25 |
|
| 26 |
|