
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k
Text Generation
•
2B
•
Updated
•
1.01k
•
1
ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning