akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SelfCompress_SFT Text Generation • Updated about 6 hours ago
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SelfCompress_SFT Text Generation • Updated about 6 hours ago
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_GRPO_14k_v3 Text Generation • Updated about 20 hours ago
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_GRPO_14k_v3 Text Generation • Updated about 20 hours ago
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_14k Text Generation • Updated 2 days ago • 27
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_14k Text Generation • Updated 2 days ago • 27