---
base_model:
- microsoft/phi-4
tags:
- text-generation-inference
- transformers
- unsloth
- phi3
- phi
- trl
- sft
- qwq
- reasoning
- code
- chat
license: mit
language:
- en
pipeline_tag: text-generation
---

Phi-4 that has been tuned to be more advanced at reasoning. Parm magic 😉

Unlike other Parm models we had to optimize out fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800**

NOTE: THIS MODEL IS NOT FINISHED YET!

# Uploaded  model

- **Developed by:** Pinkstack
- **License:** MIT
- **Finetuned from model :** microsoft/phi-4

This phi-4 model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.