Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-fhm600-batch32-epoch3-8192 Text Generation • 3B • Updated Mar 10 • 3
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192-addthinktoken-new Text Generation • 3B • Updated Mar 5 • 3
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192-addthinktoken Text Generation • 3B • Updated Mar 5 • 2
Lansechen/OLMoE-1B-7B-012-Distill-or-math220k-batch32-epoch3-8192 Text Generation • 7B • Updated Mar 5 • 4
Lansechen/Qwen2.5-3B-Instruct-Distill-bs17k-batch32-epoch3-8192 Text Generation • 3B • Updated Mar 4 • 2
Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch3-8192 Text Generation • 7B • Updated Mar 4 • 3
Lansechen/OLMoE-1B-7B-0125-Distill-or-math220k-batch32-epoch1-8192 Text Generation • 7B • Updated Mar 1 • 3
Lansechen/OLMoE-1B-7B-0125-Distill-ot114k-batch32-epoch1-8192 Text Generation • 7B • Updated Feb 28 • 4
Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch5-8192 Text Generation • 7B • Updated Feb 27 • 3
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-or-math220k-batch32 Text Generation • 7B • Updated Feb 27 • 2
Lansechen/OLMoE-1B-7B-0125-Distill-bs17k-batch32-epoch1-8192 Text Generation • 7B • Updated Feb 27 • 10 • 1
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-bs17k-batch32-epoch5-8192 Text Generation • 7B • Updated Feb 27 • 4
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-bs17k-batch32-epoch1-8192 Text Generation • 7B • Updated Feb 26 • 2
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-ot114k-batch32 Text Generation • 7B • Updated Feb 23 • 9 • 1
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-bs17k-batch32 Text Generation • 16B • Updated Feb 22 • 5
Lansechen/OLMoE-1B-7B-0125-Instruct-Distill-bs17k-batch32-epoch5 Text Generation • 7B • Updated Feb 19 • 3
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-batch16-lora-numinamath Text Generation • Updated Feb 14 • 4 • 1
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-batch8-numinamath Text Generation • 16B • Updated Feb 13 • 5 • 1