DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gpt-5-mini-2025-08-07-20260114_222454 Viewer • Updated about 4 hours ago • 300
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gemini-2.5-flash-20260114_200318 Viewer • Updated about 7 hours ago • 339 • 3
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-mini-2025-08-07-20260114_203811 Viewer • Updated about 7 hours ago • 216 • 3
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-claude-haiku-4-5-20251001-20260114_164534 Viewer • Updated about 8 hours ago • 195 • 6
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gemini-2.5-flash-20260114_175612 Viewer • Updated about 10 hours ago • 266 • 7
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocf7b91126 Viewer • Updated about 11 hours ago • 305 • 6
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-nano-2025-08-07-20260114_152435 Viewer • Updated about 11 hours ago • 198 • 8
DCAgent/eval-Qwen3-Coder-30B-A3B-Instruct_swebench-verified-random-100-folders Viewer • Updated about 12 hours ago • 300 • 10
DCAgent/eval-Qwen3-Coder-30B-A3B-Instruct_terminal-bench-2.0 Viewer • Updated about 12 hours ago • 261 • 9