DCAgent2/swebench_verified_random_100_folders_swesmith_sandboxes_with_tests_gpt_5_mini_p710cae67 Viewer • Updated about 2 hours ago • 300
DCAgent2/terminal_bench_2_glm46_Toolscale_tasks_traces_20260311_174322 Viewer • Updated about 2 hours ago • 267
DCAgent2/swebench_verified_random_100_folders_rl_r2egym_nl2bash_stack_bugsseq_fixthink_ac6ef45e4 Viewer • Updated about 3 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_1e58fbf5 Viewer • Updated about 3 hours ago • 300
DCAgent2/terminal_bench_2_rl_r2egym_nl2bash_stack_bugsseq_fixthink_again_lr1e_5_postmort1bdb5755 Viewer • Updated about 4 hours ago • 267
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_ceabc985 Viewer • Updated about 5 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_114efe33 Viewer • Updated about 5 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_3c2e552a9 Viewer • Updated about 6 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwena0e0c3f6 Viewer • Updated about 7 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwenb1c78a15 Viewer • Updated about 7 hours ago • 300