Text Generation
PEFT
Safetensors
Transformers
English
lora
sft
trl
unsloth
How to use from
Unsloth Studio
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for haritzpuerto/unsloth-Phi-4-3.8B-IF-RT to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for haritzpuerto/unsloth-Phi-4-3.8B-IF-RT to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for haritzpuerto/unsloth-Phi-4-3.8B-IF-RT to start chatting
Load model with FastModel
pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
    model_name="haritzpuerto/unsloth-Phi-4-3.8B-IF-RT",
    max_seq_length=2048,
)
Quick Links

Model Card

This model is a fine-tuned version of unsloth/Phi-4-mini-reasoning-unsloth-bnb-4bit.

Quick start

from transformers import pipeline

question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
generator = pipeline("text-generation", model="None", device="cuda")
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
print(output["generated_text"])

Training procedure

This model was trained with SFT.

Framework versions

  • PEFT 0.17.1
  • TRL: 0.23.0
  • Transformers: 4.57.1
  • Pytorch: 2.8.0
  • Datasets: 4.3.0
  • Tokenizers: 0.22.1

Citations

@misc{puerto2026controllablereasoningmodelsprivate,
      title={Controllable Reasoning Models Are Private Thinkers}, 
      author={Haritz Puerto and Haonan Li and Xudong Han and Timothy Baldwin and Iryna Gurevych},
      year={2026},
      eprint={2602.24210},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2602.24210}, 
}
Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for haritzpuerto/unsloth-Phi-4-3.8B-IF-RT

Dataset used to train haritzpuerto/unsloth-Phi-4-3.8B-IF-RT

Collection including haritzpuerto/unsloth-Phi-4-3.8B-IF-RT

Paper for haritzpuerto/unsloth-Phi-4-3.8B-IF-RT