ZHANGYUXUAN-zR nielsr HF Staff commited on
Commit
0ab7173
·
1 Parent(s): 73b81c6

Update SWE-Bench Verified results (#65)

Browse files

- Update SWE-Bench Verified results (01e89f39dc384f2de5e2b6ee62ab9b6ab4e9d96b)


Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

.eval_results/swe_bench_verified.yaml CHANGED
@@ -6,4 +6,14 @@
6
  url: https://www.swebench.com/
7
  name: SWE-Bench official evaluation
8
  user: nielsr
9
- notes: high reasoning
 
 
 
 
 
 
 
 
 
 
 
6
  url: https://www.swebench.com/
7
  name: SWE-Bench official evaluation
8
  user: nielsr
9
+ notes: high reasoning, official
10
+
11
+ - dataset:
12
+ id: SWE-bench/SWE-bench_Verified
13
+ task_id: swe_bench_%_resolved
14
+ value: 77.8
15
+ source:
16
+ url: https://huggingface.co/zai-org/GLM-5/
17
+ name: Model card
18
+ user: nielsr
19
+ notes: Z.ai reported number