ZHANGYUXUAN-zR SaylorTwift HF Staff commited on
Commit
73b81c6
·
1 Parent(s): fafc0ac

Add Terminal-Bench 2.0 evaluation result (52.4%) (#64)

Browse files

- Add Terminal-Bench 2.0 evaluation result (52.4%) (b84617d98a3b5b336983de136a971072ceea0929)


Co-authored-by: Nathan Habib <SaylorTwift@users.noreply.huggingface.co>

Files changed (1) hide show
  1. .eval_results/terminal_bench_2.yaml +10 -0
.eval_results/terminal_bench_2.yaml ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: harborframework/terminal-bench-2.0
3
+ task_id: terminalbench_2
4
+ value: 52.4
5
+ date: '2026-02-23'
6
+ source:
7
+ url: https://www.tbench.ai/leaderboard/terminal-bench/2.0
8
+ name: Terminal-Bench Leaderboard
9
+ user: SaylorTwift
10
+ notes: "agent: Terminus 2"