penfever/meta-llama_Llama-3.1-70B-Instruct-jdgfct-Conciseness Viewer • Updated 9 days ago • 985k • 57
penfever/meta-llama_Llama-3.1-70B-Instruct-jdgfct-Completeness Viewer • Updated 10 days ago • 985k • 23
penfever/meta-llama_Llama-3.1-8B-Instruct-jdgfct-Harmlessness Viewer • Updated 12 days ago • 984k • 88
penfever/meta-llama_Llama-3.1-8B-Instruct-jdgfct-Completeness Viewer • Updated 12 days ago • 984k • 94
penfever/meta-llama_Llama-3.1-8B-Instruct-jdgfct-Conciseness Viewer • Updated 12 days ago • 984k • 89
penfever/meta-llama_Llama-3.1-8B-Instruct-jdgfct-Readability Viewer • Updated 12 days ago • 984k • 17
penfever/rl__64GPU_base_32b__nl2bash-tasks-cleaned-oracle__syh-r2eg-askl-glm_4__40-0 Updated 21 days ago • 25
penfever/rl__24GPU_shaped__stackexchange-overflow-sandboxes-skywork-response__exp_tas_optimal_comb__40-0 Viewer • Updated 24 days ago • 41.8k • 60
penfever/rl__24GPU_shaped__inferredbugs-sandboxes-verifier__exp_tas_optimal_comb__40-0 Viewer • Updated 30 days ago • 30.8k • 57
penfever/rl__64GPU_shaped_32b_entropy__swe_rebench_patched_oracle__syh-r2eg-askl-glm_4__40-0 Viewer • Updated 30 days ago • 8.51k • 38
penfever/rl__24GPU_shaped__nemotron-math-oracle-filtered__exp_tas_optimal_comb__40-0 Viewer • Updated 30 days ago • 22.4k • 42
penfever/Kimi-2.5-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-32k-reward1 Viewer • Updated 30 days ago • 5.24k • 32
penfever/Kimi-2.5-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-32k Viewer • Updated 30 days ago • 9.36k • 44