build-small-hackathon/mind-of-tashi-selfplay
Viewer • Updated • 25.7k • 63
How to use build-small-hackathon/mind-of-tashi-mini-sft-lora with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("kshitijthakkar/tracegenix-mini-sft-clean-3ep")
model = PeftModel.from_pretrained(base_model, "build-small-hackathon/mind-of-tashi-mini-sft-lora")The 21.8 MB LoRA adapter behind mind-of-tashi-mini-sft (see that card for the fine-tune config and eval results): r=32 / α=64 over the attention + MLP projections, trained on the mind-of-tashi-selfplay corpus. Apply with peft onto the base model, or use the merged repo directly.
Base model
kshitijthakkar/tracegenix-mini-sft-clean-3ep
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("kshitijthakkar/tracegenix-mini-sft-clean-3ep") model = PeftModel.from_pretrained(base_model, "build-small-hackathon/mind-of-tashi-mini-sft-lora")