simonycl/octothinker-8b-hybrid-zero-cold-start-sft-step-5 Text Generation • 8B • Updated 10 days ago • 359
simonycl/octothinker-3b-hybrid-base-qwq-sft-checkpoint-462 Text Generation • 3B • Updated 15 days ago • 10
simonycl/octothinker-3b-hybrid-base-qwq-sft-checkpoint-400 Text Generation • 3B • Updated 15 days ago • 17
simonycl/octothinker-3b-hybrid-base-qwq-sft-checkpoint-300 Text Generation • 3B • Updated 15 days ago • 17
simonycl/octothinker-3b-hybrid-base-qwq-sft-checkpoint-200 Text Generation • 3B • Updated 15 days ago • 17
simonycl/octothinker-3b-hybrid-base-qwq-sft-checkpoint-100 Text Generation • 3B • Updated 15 days ago • 19
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_60k_win_only Text Generation • 4B • Updated Aug 5 • 9
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_60k_whole Text Generation • 4B • Updated Aug 5 • 10
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_30k_win_only Text Generation • 4B • Updated Aug 5 • 10
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_30k_whole Text Generation • 4B • Updated Aug 5 • 10
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_15k_win_only Text Generation • 4B • Updated Aug 5 • 11
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_5k_win_only Text Generation • 4B • Updated Aug 5 • 9
simonycl/sft-qwen3-4b-game-eval-Qwen-QwQ-32B-vs-Qwen-QwQ-32B-20250805-game_eval_5k_whole Text Generation • 4B • Updated Aug 5 • 9
simonycl/octothinker-3b-hybrid-zero-cold-start-step-5 Text Generation • 3B • Updated Jul 23 • 244
simonycl/gemma_3_27b_cmv_hard_persuasion_judge_new Image-Text-to-Text • 27B • Updated May 15 • 6