thomasjhuang's picture
Upload SFT warmstart checkpoint-1000 for Qwen2.5-0.5B trained on cognitive behavioral reasoning
6ffcc38 verified