--- license: mit license_link: https://huggingface.co/microsoft/phi-4/resolve/main/LICENSE language: - en pipeline_tag: text-generation tags: - phi - nlp - math - code - chat - conversational - phi3 inference: parameters: temperature: 0 widget: - messages: - role: user content: How many R's in strawberry? Think step by step. library_name: transformers --- gguf/final version: https://huggingface.co/Pinkstack/PARM-V2-phi-4-16k-CoT-o1-gguf [Phi-4 Technical Report](https://arxiv.org/pdf/2412.08905) Phi-4 that has been tuned to be more advanced at reasoning. Parm magic 😉 Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800** NOTE: more information soon, gguf # Uploaded model - **Developed by:** Pinkstack - **License:** MIT - **Finetuned from model :** microsoft/phi-4 This phi-4 model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.