GGUF
llama
conversational
deepseek-r1-medical / README.md
fzkun's picture
Create README.md
f60534d verified
metadata
license: apache-2.0
datasets:
  - FreedomIntelligence/medical-o1-reasoning-SFT
base_model:
  - unsloth/DeepSeek-R1-Distill-Llama-8B

基于DeepSeek-R1-Distill-Llama-8B进行微调

数据集(FreedomIntelligence/medical-o1-reasoning-SFT数据集)

https://huggingface.co/datasets/FreedomIntelligence/medical-o1-reasoning-SFT

colab地址:

https://colab.research.google.com/drive/1N0Sf9yn8Tjs5gMJv-rez-0hzxBUDK3xK?usp=sharing#scrollTo=WisaI4QOw07u