--- license: apache-2.0 datasets: - FreedomIntelligence/medical-o1-reasoning-SFT base_model: - unsloth/DeepSeek-R1-Distill-Llama-8B --- # 基于DeepSeek-R1-Distill-Llama-8B进行微调 ## 数据集(FreedomIntelligence/medical-o1-reasoning-SFT数据集) https://huggingface.co/datasets/FreedomIntelligence/medical-o1-reasoning-SFT ## colab地址: https://colab.research.google.com/drive/1N0Sf9yn8Tjs5gMJv-rez-0hzxBUDK3xK?usp=sharing#scrollTo=WisaI4QOw07u