fudayuan commited on
Commit
0a3b9e1
·
verified ·
1 Parent(s): 15f5e39

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -57
README.md CHANGED
@@ -6,17 +6,26 @@ language:
6
  - en
7
  pipeline_tag: text-generation
8
  library_name: transformers
 
 
 
 
 
 
 
9
  ---
10
 
 
11
 
 
12
 
13
- # OpenSWE: Efficient SWE Environment Synthesis at Scale
 
 
 
 
14
 
15
- <p align="center">
16
- <img src="asset/arxiv-logo.svg" alt="arXiv" width="16" height="16"> <a href="https://arxiv.org/abs/" target="_blank">Paper</a> &nbsp; | &nbsp;
17
- <a href="https://github.com/GAIR-NLP/OpenSWE" target="_blank">Code</a> &nbsp; | &nbsp;
18
- <a href="https://github.com/GAIR-NLP/OpenSWE" target="_blank">Environments & Scripts</a>
19
- </p>
20
 
21
  <p align="center"> <img src="asset/teaser.png" style="width: 93%;" id="title-icon"> </p>
22
  </div>
@@ -78,52 +87,6 @@ This repository contains the official implementation of the OpenSWE pipeline—a
78
 
79
  Training on OpenSWE alone yields large improvements over SWE-rebench across all model sizes and scaffolds; combining with SWE-rebench further improves 72B (e.g., 68.0% SWE-Agent). Data scaling analysis shows log-linear improvement with no saturation (see paper for curves). General capability evaluation shows gains on code (e.g., HumanEval +29), math (e.g., MATH-500 +12.2 for 72B), and science benchmarks without degrading factual recall.
80
 
81
- ## Quick Start
82
-
83
- ### 1. Data schema
84
-
85
- Collect your dataset in the following schema:
86
-
87
- | Field | Type | Description |
88
- |-------|------|-------------|
89
- | `instance_id` | `str` | Unique identifier for the sample. |
90
- | `repo` | `str` | Full GitHub repo name (e.g., `psf/requests`). |
91
- | `base_commit` | `str` | SHA of the commit immediately before the PR's first change. |
92
- | `end_commit` | `str` | SHA of the final commit in the PR. |
93
- | `problem_statement` | `str` | Issue description or problem to solve. |
94
- | `patch` | `str` | Diff of changes to functional (non-test) code. |
95
- | `test_patch` | `str` | Diff of changes to the test suite. |
96
- | `language` | `str` | Primary programming language of the repo. |
97
-
98
- ### 2. (Recommended) Prepare system
99
-
100
- - Download all git repositories into a _repocache_ directory.
101
- - Build base Docker images with `scripts/prepare_baseimg.py`.
102
-
103
- ### 3. Apply patches for SWE-bench evaluation
104
-
105
- Before running evaluation, apply:
106
-
107
- - **swe-agent.patch** — for [SWE-agent/SWE-agent](https://github.com/SWE-agent/SWE-agent): adds `skip_fetch` and OpenSWE instance fields.
108
- - **swe-bench-fork.patch** — for [SWE-rebench/SWE-bench-fork](https://github.com/SWE-rebench/SWE-bench-fork): adds `eval_script` support and `OPENSWE_EXIT_CODE` grading.
109
-
110
- Replace `/path/to/openswe` with your OpenSWE repo root. On conflicts use `git apply --reject` and resolve `.rej` files. Apply each patch once per repo.
111
-
112
- ### 4. Configure and run
113
-
114
- Edit `examples/run.sh` (set `OPENSWE_ROOT`, `DATA_PATH`, `OUTPUT_DIR`, `SETUP_DIR`, `RESULT_DIR`, `DATA_PATH`, API keys, and `DOCKER_REPOSITORY`), then:
115
-
116
- ```bash
117
- bash examples/run.sh
118
- ```
119
-
120
- For multi-machine building, see [Parallel Task Execution System](./scripts/parallel).
121
-
122
- ## Troubleshooting
123
-
124
- - **Dataset missing**: Ensure your dataset JSONL exists at the path set in `DATA_PATH`; check schema matches the table above.
125
- - **Patch conflicts**: Resolve `.rej` files after `git apply --reject` for swe-agent and swe-bench-fork.
126
-
127
  ## Acknowledgement
128
 
129
  OpenSWE is inspired by [SWE-Rebench](https://arxiv.org/abs/2505.20411) and [SWE-Factory](https://arxiv.org/abs/2506.10954). We thank these teams for their open-source contributions.
@@ -137,10 +100,13 @@ This project is licensed under AGPL-3.0. See [LICENSE](./LICENSE) for details.
137
  If you find OpenSWE useful, please cite:
138
 
139
  ```bibtex
140
- @article{openswe2026,
141
- title={daVinci-Env: Open SWE Environment Synthesis at Scale},
142
- author={Dayuan Fu and Shenyu Wu and Yunze Wu and Zerui Peng and Yaxing Huang and Jie Sun and Ji Zeng and Mohan Jiang and Lin Zhang and Yukun Li and Jiarui Hu and Liming Liu and Jinlong Hou and Pengfei Liu},
143
- journal={arXiv preprint},
144
- year={2026}
 
 
 
145
  }
146
  ```
 
6
  - en
7
  pipeline_tag: text-generation
8
  library_name: transformers
9
+ base_model:
10
+ - Qwen/Qwen2.5-72B
11
+ tags:
12
+ - software
13
+ - environment
14
+ - agent
15
+ - code
16
  ---
17
 
18
+ # OpenSWE: Efficient SWE Environment Synthesis at Scale
19
 
20
+ <div align="center">
21
 
22
+ [![Paper](https://img.shields.io/badge/Paper-PDF-1f6feb.svg)](https://github.com/GAIR-NLP/OpenSWE/blob/main/asset/paper.pdf)
23
+ [![arXiv](https://img.shields.io/badge/arXiv-2601.18418-b31b1b.svg)](https://arxiv.org/pdf/2603.13023)
24
+ [![GitHub](https://img.shields.io/badge/GitHub-Repository-green)](https://github.com/GAIR-NLP/OpenSWE)
25
+ [![Hugging Face](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Dataset-blue)](https://huggingface.co/datasets/GAIR/OpenSWE)
26
+ [![Hugging Face](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Model-blue)](https://huggingface.co/GAIR/OpenSWE-72B)
27
 
28
+ </div>
 
 
 
 
29
 
30
  <p align="center"> <img src="asset/teaser.png" style="width: 93%;" id="title-icon"> </p>
31
  </div>
 
87
 
88
  Training on OpenSWE alone yields large improvements over SWE-rebench across all model sizes and scaffolds; combining with SWE-rebench further improves 72B (e.g., 68.0% SWE-Agent). Data scaling analysis shows log-linear improvement with no saturation (see paper for curves). General capability evaluation shows gains on code (e.g., HumanEval +29), math (e.g., MATH-500 +12.2 for 72B), and science benchmarks without degrading factual recall.
89
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
90
  ## Acknowledgement
91
 
92
  OpenSWE is inspired by [SWE-Rebench](https://arxiv.org/abs/2505.20411) and [SWE-Factory](https://arxiv.org/abs/2506.10954). We thank these teams for their open-source contributions.
 
100
  If you find OpenSWE useful, please cite:
101
 
102
  ```bibtex
103
+ @misc{fu2026davincienvopensweenvironment,
104
+ title={daVinci-Env: Open SWE Environment Synthesis at Scale},
105
+ author={Dayuan Fu and Shenyu Wu and Yunze Wu and Zerui Peng and Yaxing Huang and Jie Sun and Ji Zeng and Mohan Jiang and Lin Zhang and Yukun Li and Jiarui Hu and Liming Liu and Jinlong Hou and Pengfei Liu},
106
+ year={2026},
107
+ eprint={2603.13023},
108
+ archivePrefix={arXiv},
109
+ primaryClass={cs.SE},
110
+ url={https://arxiv.org/abs/2603.13023},
111
  }
112
  ```