daichira/structured-hard-sft-4k
Viewer • Updated • 4k • 55 • 1
How to use shinich001/qwen3-4b-h100-v5-hard-ep3 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("unsloth/qwen3-4b-instruct-2507-unsloth-bnb-4bit")
model = PeftModel.from_pretrained(base_model, "shinich001/qwen3-4b-h100-v5-hard-ep3")Top-ranker strategy model. Trained on H100 with a blend of three datasets (approx. 14k rows) and heavily preprocessed with custom clean_assistant_output_v2 (CoT stripping, markdown removal, TOML comment removal).
Base model
Qwen/Qwen3-4B-Instruct-2507