---
license: mit
license_link: https://huggingface.co/microsoft/phi-4/resolve/main/LICENSE
language:
- en
pipeline_tag: text-generation
tags:
- phi
- nlp
- math
- code
- chat
- conversational
- phi3
inference:
  parameters:
    temperature: 0
widget:
- messages:
  - role: user
    content: How many R's in strawberry? Think step by step. 
library_name: transformers
---

gguf/final version: https://huggingface.co/Pinkstack/PARM-V2-phi-4-16k-CoT-o1-gguf

[Phi-4 Technical Report](https://arxiv.org/pdf/2412.08905)
Phi-4 that has been tuned to be more advanced at reasoning. Parm magic 😉

Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**

NOTE: more information soon, gguf

# Uploaded  model

- **Developed by:** Pinkstack
- **License:** MIT
- **Finetuned from model :** microsoft/phi-4

This phi-4 model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.