Instructions to use RLHFlow/RewardModel-Mistral-7B-for-DPA-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use RLHFlow/RewardModel-Mistral-7B-for-DPA-v1 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="RLHFlow/RewardModel-Mistral-7B-for-DPA-v1", trust_remote_code=True)# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("RLHFlow/RewardModel-Mistral-7B-for-DPA-v1", trust_remote_code=True) model = AutoModelForSequenceClassification.from_pretrained("RLHFlow/RewardModel-Mistral-7B-for-DPA-v1", trust_remote_code=True) - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -42,7 +42,7 @@ Here is a sample code that you can try
|
|
| 42 |
from transformers import AutoModelForSequenceClassification,AutoTokenizer
|
| 43 |
import torch
|
| 44 |
device = 'cuda'
|
| 45 |
-
path = "
|
| 46 |
rm = AutoModelForSequenceClassification.from_pretrained(path, trust_remote_code=True).to(device)
|
| 47 |
tokenizer = AutoTokenizer.from_pretrained(path)
|
| 48 |
|
|
|
|
| 42 |
from transformers import AutoModelForSequenceClassification,AutoTokenizer
|
| 43 |
import torch
|
| 44 |
device = 'cuda'
|
| 45 |
+
path = "RLHFlow/RewardModel-Mistral-7B-for-DPA-v1"
|
| 46 |
rm = AutoModelForSequenceClassification.from_pretrained(path, trust_remote_code=True).to(device)
|
| 47 |
tokenizer = AutoTokenizer.from_pretrained(path)
|
| 48 |
|