license: apache-2.0 | |
datasets: | |
- FreedomIntelligence/medical-o1-reasoning-SFT | |
base_model: | |
- unsloth/DeepSeek-R1-Distill-Llama-8B | |
# 基于DeepSeek-R1-Distill-Llama-8B进行微调 | |
## 数据集(FreedomIntelligence/medical-o1-reasoning-SFT数据集) | |
https://huggingface.co/datasets/FreedomIntelligence/medical-o1-reasoning-SFT | |
## colab地址: | |
https://colab.research.google.com/drive/1N0Sf9yn8Tjs5gMJv-rez-0hzxBUDK3xK?usp=sharing#scrollTo=WisaI4QOw07u |