kz919/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Cautious-TRL-0.18.0.dev Text Generation • Updated 18 days ago • 24 • 1
kz919/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Cautious-TRL-0.18.0.dev Text Generation • Updated 18 days ago • 24 • 1
view post Post 2666 Anyone using AI and ML to help neurodivergent people? I'd love to hear what you're doing. See translation 2 replies · 👀 7 7 + Reply
kz919/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Cautious-TRL-0.18.0.dev Text Generation • Updated 18 days ago • 24 • 1