ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-zw1_1_1-20251217_step_10000 2B • Updated 6 days ago • 9
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.01-zw1_1_1-20251216_step_10000 2B • Updated 6 days ago • 12
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0-zw1_1_1-20251216_step_10000 2B • Updated 6 days ago • 8
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-zw1_1_1-minspan2-20251217_step_10000 2B • Updated 6 days ago • 9
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-longest_only-20251218_step_10000 2B • Updated 6 days ago • 12
ymh233/Focal_pretrain-Qwen-Qwen2.5-Coder-1.5B-NTE-d512-w0.1-zw1_1_1-minspan2-20251217_step_19000 2B • Updated 6 days ago • 13
ymh233/Focal_pretrain-Llama-3.2-1B-NTE-d512-w0.1-longest_only-20251218_step_8000 1B • Updated 7 days ago • 8
ymh233/Focal_nvidia-OpenReasoning-Nemotron-1.5B-nemotron_code_lf_filtered-train_prompt-lr1e-04-ep3 2B • Updated 14 days ago • 12
ymh233/NTP-Qwen2p5-7B-autoprogrammer_nemotron_code_lf_filtered-train_prompt-lr1e-05-ep4 Updated 14 days ago