eth-nlped/Qwen2.5-1.5B-pedagogical-rewardmodel Text Classification • 2B • Updated 16 days ago • 58 • 3
locuslab/mix_ift_v4-smollm2-1.7b-score0_mix_rephrased_from_beginning_metadata-600B 2B • Updated Feb 26 • 1
locuslab/ift_then_gsm-smollm2-1.7b-score0_mix_rephrased_from_beginning_metadata-600B 2B • Updated Feb 26 • 2
locuslab/mix_ift_v4-smollm2-1.7b-score0_baseline60p_then_mix_rephrase_with_refusal-600B 2B • Updated Mar 2 • 2
locuslab/ift_then_gsm-smollm2-1.7b-score0_baseline60p_then_mix_rephrase_with_refusal-600B 2B • Updated Mar 2 • 2
locuslab/base-smollm2-1.7b-score0_baseline60p_then_mix_rephrase_with_refusal_and_metadata_5p-600B Updated Mar 3 • 1
locuslab/base-smollm2-1.7b-score0_with_123rephrase_with_45refusal-600B-mbs8-gbs1024-03mar_step-00030000 Updated Mar 4 • 2
locuslab/base-smollm2-1.7b-score0_mix_rephrased_from_beginning-600B-mbs8-gbs1024-17feb_step-00030000 Updated Mar 4 • 2