ajagota71/toxicity-reward-model-output-max-margin-1-seed-400-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 6
ajagota71/toxicity-reward-model-output-max-margin-0.1-seed-400-unfrozen-layers-0-pythia-70m Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-0.1-seed-400-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 29
ajagota71/toxicity-reward-model-output-max-margin-10-seed-300-unfrozen-layers-0-pythia-70m Updated May 15 • 53
ajagota71/toxicity-reward-model-output-max-margin-10-seed-300-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 6
ajagota71/toxicity-reward-model-output-max-margin-5-seed-300-unfrozen-layers-0-pythia-70m Updated May 15 • 8
ajagota71/toxicity-reward-model-output-max-margin-5-seed-300-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 29
ajagota71/toxicity-reward-model-output-max-margin-1-seed-300-unfrozen-layers-0-pythia-70m Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-1-seed-300-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-0.1-seed-300-unfrozen-layers-0-pythia-70m Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-0.1-seed-300-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-10-seed-200-unfrozen-layers-0-pythia-70m Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-10-seed-200-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-5-seed-200-unfrozen-layers-0-pythia-70m Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-5-seed-200-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 41
ajagota71/toxicity-reward-model-output-max-margin-1-seed-200-unfrozen-layers-0-pythia-70m Updated May 15 • 5
ajagota71/toxicity-reward-model-output-max-margin-1-seed-200-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-0.1-seed-200-unfrozen-layers-0-pythia-70m Updated May 15 • 25
ajagota71/toxicity-reward-model-output-max-margin-0.1-seed-200-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 25
ajagota71/toxicity-reward-model-output-max-margin-10-seed-100-unfrozen-layers-0-pythia-70m Updated May 15 • 20
ajagota71/toxicity-reward-model-output-max-margin-10-seed-100-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-5-seed-100-unfrozen-layers-0-pythia-70m Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-5-seed-100-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-1-seed-100-unfrozen-layers-0-pythia-70m Updated May 15 • 7
ajagota71/toxicity-reward-model-output-max-margin-1-seed-100-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 41
ajagota71/toxicity-reward-model-output-max-margin-0.1-seed-100-unfrozen-layers-0-pythia-70m Updated May 15 • 83
ajagota71/toxicity-reward-model-output-max-margin-0.1-seed-100-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 30
ajagota71/toxicity-reward-model-output-max-margin-10-seed-42-unfrozen-layers-0-pythia-70m Updated May 15 • 6
ajagota71/toxicity-reward-model-output-max-margin-10-seed-42-unfrozen-layers-0-pythia-70m-checkpoint-30 Updated May 15 • 41
ajagota71/toxicity-reward-model-output-max-margin-5-seed-42-unfrozen-layers-0-pythia-70m Updated May 15 • 7