evanellis/codeforces_llama3.3_70B_human_with_Qwen3-14B_blind_null_agreement_thresh0.8_f Viewer • Updated Sep 3 • 3.27k • 60
evanellis/codeforces_llama3.3_70B_human_with_Qwen3-14B_blind_null_agreement_thresh0.8 Viewer • Updated Sep 3 • 3.27k • 52
evanellis/codeforces_gemma_3_27b_it_human_with_Qwen3-14B_blind_null_agreement_thresh0.8_f Viewer • Updated Sep 3 • 3.31k • 25
evanellis/codeforces_gemma_3_27b_it_human_with_Qwen3-14B_blind_null_agreement_thresh0.8 Viewer • Updated Sep 3 • 3.31k • 39
evanellis/codeforces_gemma_3_27b_it_human_with_Qwen3-8B_blind_null_agreement_thresh0.8_f Viewer • Updated Sep 3 • 3.31k • 26
evanellis/codeforces_gemma_3_27b_it_human_with_Qwen3-8B_blind_null_agreement_thresh0.8 Viewer • Updated Sep 3 • 3.31k • 27
evanellis/codeforces_gemma_3_27b_it_human_with_Llama-3.1-8B-Instruct_thresh0.8_f Viewer • Updated Sep 3 • 3.31k • 40
evanellis/codeforces_gemma_3_27b_it_human_with_Llama-3.1-8B-Instruct_thresh0.8 Viewer • Updated Sep 3 • 3.31k • 38
evanellis/codeforces_gemma_3_27b_it_human_with_Llama-3.1-8B-Instruct_eta0.5_f Viewer • Updated Sep 2 • 3.31k • 34
evanellis/codeforces_gemma_3_27b_it_human_with_Llama-3.1-8B-Instruct_eta0.5 Viewer • Updated Sep 2 • 3.31k • 38