OctoThinker/OctoThinker-3B-Hybrid-Zero
Text Generation
⢠4B ⢠Updated ⢠6
⢠1
OctoThinker/OctoThinker-3B-Hybrid-Base
Text Generation
⢠3B ⢠Updated ⢠5.09k
⢠1
OctoThinker/OctoThinker-3B-Short-Zero
Text Generation
⢠4B ⢠Updated ⢠4
⢠1
OctoThinker/OctoThinker-3B-Short-Base
Text Generation
⢠3B ⢠Updated ⢠17
OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_100B
Text Generation
⢠Updated OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_general_ins_89_10_1_bs4M_seq8k_20B
Text Generation
⢠Updated OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_91_bs4M_seq8k_20B
Text Generation
⢠Updated OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_general_ins_89_10_1_bs4M_seq8k_20B
Text Generation
⢠Updated OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_91_bs4M_seq8k_20B
Text Generation
⢠Updated OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_20B
Text Generation
⢠Updated OctoThinker/Llama_32_3B_megamath_web_pro_bs4M_seq8k_20B
Text Generation
⢠Updated OctoThinker/Llama_32_3B_finemath_4p_bs4M_seq8k_20B
Text Generation
⢠Updated OctoThinker/OctoThinker-3B-Long-Zero
Text Generation
⢠4B ⢠Updated ⢠10
OctoThinker/OctoThinker-1B-Short-Zero
Text Generation
⢠1B ⢠Updated ⢠11
OctoThinker/OctoThinker-1B-Hybrid-Zero
Text Generation
⢠1B ⢠Updated ⢠1
OctoThinker/OctoThinker-1B-Long-Zero
Text Generation
⢠1B ⢠Updated ⢠7
OctoThinker/OctoThinker-3B-Long-Base
Text Generation
⢠3B ⢠Updated ⢠7
⢠1
OctoThinker/OctoThinker-1B-Short-Base
Text Generation
⢠1B ⢠Updated ⢠22
OctoThinker/OctoThinker-1B-Hybrid-Base
Text Generation
⢠1B ⢠Updated ⢠71
⢠1
OctoThinker/OctoThinker-1B-Long-Base
Text Generation
⢠1B ⢠Updated ⢠3
OctoThinker/OctoThinker-8B-Short-Base
Text Generation
⢠8B ⢠Updated ⢠5
⢠1
OctoThinker/OctoThinker-8B-Hybrid-Base
Text Generation
⢠8B ⢠Updated ⢠13.2k
⢠2
OctoThinker/OctoThinker-8B-Long-Base
Text Generation
⢠8B ⢠Updated ⢠2
OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_general_ins_89_10_1_bs4M_seq16k_20B
Updated
OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_31_bs4M_seq8k_20B
Updated
OctoThinker/Llama3.2-3B-Zero
4B ⢠Updated