ajagota71/gemma-3-270m-detox-checkpoint-epoch-100 Reinforcement Learning • 0.3B • Updated 23 days ago • 9
ajagota71/gemma-3-270m-detox-checkpoint-epoch-80 Reinforcement Learning • 0.3B • Updated 23 days ago • 9
ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-100 Reinforcement Learning • 0.5B • Updated 23 days ago • 8
ajagota71/gemma-3-270m-detox-checkpoint-epoch-60 Reinforcement Learning • 0.3B • Updated 23 days ago • 9
ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-80 Reinforcement Learning • 0.5B • Updated 23 days ago • 8
ajagota71/gemma-3-270m-detox-checkpoint-epoch-40 Reinforcement Learning • 0.3B • Updated 23 days ago • 8
ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-60 Reinforcement Learning • 0.5B • Updated 23 days ago • 9
ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-40 Reinforcement Learning • 0.5B • Updated 23 days ago • 9