Self-Taught Self-Correction for Small Language Models Paper • 2503.08681 • Published Mar 11 • 15
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24 • 811k • • 1.29k