Contributed by the Harvard's AI Safety Student Team community.

0 comments
10 forecasters

What will be the best performance on SWE-bench Verified by December 31st 2025?

Latest estimate
94.2