Contributed by the Harvard's AI Safety Student Team community.
0 comments
10 forecasters
What will be the best performance on SWE-bench Verified by December 31st 2025?
Latest estimate
94.2
Forecast Timeline
Authors:
Opened:Feb 9, 2025
Closed:Feb 17, 2025
Scheduled resolution:Jan 1, 2026
Spot Scoring Time:Feb 17, 2025