hbXNov/LLaDA-8B-Instruct-mlp2x_gelu-pretrain_blip558_v4-cont_200k_openllavanext_allava_gpt4omini 8B • Updated Apr 17 • 49
hbXNov/llama_8b_instruct_distill_r1_q1p5b_balanced_train_e3_lr5e-7_all-ckpt_3278 8B • Updated Mar 3 • 9
hbXNov/llama_8b_instruct_distill_r1_q1p5b_balanced_train_e6_lr5e-7_balanced_ckpt-4386 8B • Updated Mar 2 • 10
hbXNov/qwen_1p5b_base_distill_r1_q1p5b_balanced_train_e3_lr1e-5_balanced_ckpt_2193 2B • Updated Mar 2 • 10
hbXNov/llama_8b_instruct_distill_r1_q1p5b_balanced_train_e3_lr5e-7_balanced_ckpt_2193 8B • Updated Mar 1 • 9
hbXNov/llama_8b_instruct_distill_r1_q1p5b_balanced_train_e3_lr1e-5_balanced_ckpt_2193 8B • Updated Mar 1 • 12
hbXNov/qwen_1p5b_instruct_distill_r1_q1p5b_train_e3_lr1e-5_balanced-ckpt-4383 2B • Updated Feb 27 • 6
hbXNov/qwen_2p5_1p5b_instruct_distill_qwen_1p5b_gpt_4o_verify_1e-5_3072_e6-checkpoint-7536-merged 2B • Updated Jan 29 • 8
hbXNov/qwen_2p5_1p5b_instruct_distill_qwen_1p5b_gpt_4o_verify_5e-7_3072_merged 2B • Updated Jan 29 • 9