OurBigBook.com $£ Sponsor 中国独裁统治 China Dictatorship 新疆改造中心、六四事件、法轮功、郝海东、709大抓捕、2015巴拿马文件邓家贵、低端人口、西藏骚乱

SWE-bench (2024)

www.swebench.com/

By Princeton people.

This one aims to solve GitHub issues. It appears to contain 2,294 real-world GitHub issues and their corresponding pull requests.

Evaluation is simply based on "does the pull request make some pre-written failing test cases pass".

The dataset appears to be at: huggingface.co/datasets/princeton-nlp/SWE-bench in Parquet format.

 Ancestors (9)