--- base_model: - microsoft/phi-4 tags: - text-generation-inference - transformers - unsloth - phi3 - phi - trl - sft - qwq - reasoning - code - chat license: mit language: - en pipeline_tag: text-generation --- Phi-4 that has been tuned to be more advanced at reasoning. Parm magic 😉 Unlike other Parm models we had to optimize out fine tuning process to ensure accuracy while still being able to release this model. **Training loss: 0.443800** NOTE: THIS MODEL IS NOT FINISHED YET! # Uploaded model - **Developed by:** Pinkstack - **License:** MIT - **Finetuned from model :** microsoft/phi-4 This phi-4 model was trained with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.