argilla/ultrafeedback-binarized-preferences-cleaned
Viewer • Updated • 60.9k • 14.4k • 162
How to use StefanKrsteski/Phi-3-mini-4k-instruct-DPO-EPFL with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("microsoft/Phi-3-mini-4k-instruct")
model = PeftModel.from_pretrained(base_model, "StefanKrsteski/Phi-3-mini-4k-instruct-DPO-EPFL")training_args and trainer configuration)eval_loss, with the aim to minimize this value.[Include references and citations for datasets, tools, and methodologies used.]
PEFT 0.11.1
Base model
microsoft/Phi-3-mini-4k-instruct