ajagota71/gemma-3-270m-detox-checkpoint-epoch-100 Reinforcement Learning • 0.3B • Updated 24 days ago • 9
ajagota71/gemma-3-270m-detox-checkpoint-epoch-80 Reinforcement Learning • 0.3B • Updated 24 days ago • 9
ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-100 Reinforcement Learning • 0.5B • Updated 24 days ago • 8
ajagota71/gemma-3-270m-detox-checkpoint-epoch-60 Reinforcement Learning • 0.3B • Updated 24 days ago • 9
ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-80 Reinforcement Learning • 0.5B • Updated 24 days ago • 8
ajagota71/gemma-3-270m-detox-checkpoint-epoch-40 Reinforcement Learning • 0.3B • Updated 24 days ago • 8
ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-60 Reinforcement Learning • 0.5B • Updated 24 days ago • 9
ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-40 Reinforcement Learning • 0.5B • Updated 24 days ago • 9
ajagota71/gemma-3-270m-detox-checkpoint-epoch-20 Reinforcement Learning • 0.3B • Updated 24 days ago • 10
ajagota71/Qwen2.5-0.5B-detox-checkpoint-epoch-20 Reinforcement Learning • 0.5B • Updated 24 days ago • 9
ajagota71/SmolLM2-360M-detox-checkpoint-epoch-100 Reinforcement Learning • 0.4B • Updated 25 days ago • 8
ajagota71/SmolLM2-360M-detox-checkpoint-epoch-80 Reinforcement Learning • 0.4B • Updated 25 days ago • 9
ajagota71/SmolLM2-135M-detox-checkpoint-epoch-100 Reinforcement Learning • 0.1B • Updated 25 days ago • 7
ajagota71/SmolLM2-135M-detox-checkpoint-epoch-80 Reinforcement Learning • 0.1B • Updated 25 days ago • 9
ajagota71/SmolLM2-360M-detox-checkpoint-epoch-60 Reinforcement Learning • 0.4B • Updated 25 days ago • 9
ajagota71/SmolLM2-135M-detox-checkpoint-epoch-60 Reinforcement Learning • 0.1B • Updated 25 days ago • 9
ajagota71/SmolLM2-360M-detox-checkpoint-epoch-40 Reinforcement Learning • 0.4B • Updated 25 days ago • 11
ajagota71/SmolLM2-135M-detox-checkpoint-epoch-40 Reinforcement Learning • 0.1B • Updated 25 days ago • 9
ajagota71/SmolLM2-360M-detox-checkpoint-epoch-20 Reinforcement Learning • 0.4B • Updated 25 days ago • 10
ajagota71/SmolLM2-135M-detox-checkpoint-epoch-20 Reinforcement Learning • 0.1B • Updated 25 days ago • 10
ajagota71/SmolLM-360M-detox-checkpoint-epoch-100 Reinforcement Learning • 0.4B • Updated 25 days ago • 14
ajagota71/SmolLM-135M-detox-checkpoint-epoch-100 Reinforcement Learning • 0.1B • Updated 25 days ago • 15
ajagota71/SmolLM-360M-detox-checkpoint-epoch-80 Reinforcement Learning • 0.4B • Updated 25 days ago • 15
ajagota71/SmolLM-135M-detox-checkpoint-epoch-80 Reinforcement Learning • 0.1B • Updated 25 days ago • 15