metadata
license: apache-2.0
datasets:
- FreedomIntelligence/medical-o1-reasoning-SFT
base_model:
- unsloth/DeepSeek-R1-Distill-Llama-8B
基于DeepSeek-R1-Distill-Llama-8B进行微调
数据集(FreedomIntelligence/medical-o1-reasoning-SFT数据集)
https://huggingface.co/datasets/FreedomIntelligence/medical-o1-reasoning-SFT