aisi-whitebox-red-team/hovik_bigcodebench_weak_model_llama_31_8b_instruct_bigcodebench Viewer • Updated Jul 22 • 2.82k • 10
aisi-whitebox-red-team/hovik_bigcodebench_weak_model_llama_31_8b_instruct_bigcodebench Viewer • Updated Jul 22 • 2.82k • 10
aisi-whitebox-red-team/bigcodebench_sandbagging_llama_distillation Viewer • Updated Jul 21 • 8.46k • 7
aisi-whitebox-red-team/hovik_bigcodebench_more_sandbagging_llama_33_70b_instruct_bigcodebench_sandbagging Viewer • Updated Jul 21 • 1.35k • 5
aisi-whitebox-red-team/hovik_bigcodebench_more_sandbagging_llama_33_70b_instruct_bigcodebench_sandbagging Viewer • Updated Jul 21 • 1.35k • 5
aisi-whitebox-red-team/hovik_bigcodebench_sandbagging_llama_31_8b_instruct_bigcodebench Viewer • Updated Jul 21 • 1.48k • 13
aisi-whitebox-red-team/hovik_bigcodebench_more_sandbagging_llama_33_70b_instruct_bigcodebench_benign Viewer • Updated Jul 21 • 1.09k • 8
aisi-whitebox-red-team/hovik_bigcodebench_more_sandbagging_llama_33_70b_instruct_bigcodebench_benign Viewer • Updated Jul 21 • 1.09k • 8
aisi-whitebox-red-team/hovik_bigcodebench_sandbagging_llama_31_8b_instruct_bigcodebench Viewer • Updated Jul 21 • 1.48k • 13
aisi-whitebox-red-team/hovik_bigcodebench_more_sandbagging_llama_33_70b_instruct_bigcodebench Viewer • Updated Jul 21 • 3.76k • 38
aisi-whitebox-red-team/hovik_bigcodebench_more_sandbagging_llama_33_70b_instruct_bigcodebench Viewer • Updated Jul 21 • 3.76k • 38
aisi-whitebox-red-team/hovik_bigcodebench_llama_33_70b_instruct_bigcodebench Viewer • Updated Jul 21 • 3.76k • 91
aisi-whitebox-red-team/hovik_bigcodebench_llama_33_70b_instruct_bigcodebench Viewer • Updated Jul 21 • 3.76k • 91