---
license: apache-2.0
datasets:
- FreedomIntelligence/medical-o1-reasoning-SFT
base_model:
- unsloth/DeepSeek-R1-Distill-Llama-8B
---
# 基于DeepSeek-R1-Distill-Llama-8B进行微调


## 数据集(FreedomIntelligence/medical-o1-reasoning-SFT数据集)
https://huggingface.co/datasets/FreedomIntelligence/medical-o1-reasoning-SFT

## colab地址：
https://colab.research.google.com/drive/1N0Sf9yn8Tjs5gMJv-rez-0hzxBUDK3xK?usp=sharing#scrollTo=WisaI4QOw07u