File size: 1,015 Bytes
81554fc
25ddc0a
5e0d928
81554fc
 
25ddc0a
5e0d928
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
81554fc
 
d9fe9bd
 
5e0d928
f4ee896
25ddc0a
736b696
25ddc0a
b8979ff
f4ee896
81554fc
 
 
25ddc0a
f4ee896
81554fc
25ddc0a
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
---
license: mit
license_link: https://huggingface.co/microsoft/phi-4/resolve/main/LICENSE
language:
- en
pipeline_tag: text-generation
tags:
- phi
- nlp
- math
- code
- chat
- conversational
- phi3
inference:
  parameters:
    temperature: 0
widget:
- messages:
  - role: user
    content: How many R's in strawberry? Think step by step. 
library_name: transformers
---

gguf/final version: https://huggingface.co/Pinkstack/PARM-V2-phi-4-16k-CoT-o1-gguf

[Phi-4 Technical Report](https://arxiv.org/pdf/2412.08905)
Phi-4 that has been tuned to be more advanced at reasoning. Parm magic 😉

Unlike other Parm models we had to optimize our fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**

NOTE: more information soon, gguf

# Uploaded  model

- **Developed by:** Pinkstack
- **License:** MIT
- **Finetuned from model :** microsoft/phi-4

This phi-4 model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.