7 comments
77 forecasters
What will be the best score in the 2019/2020 Winograd Schema AI challenge
Forecast Timeline
Authors:
Opened:Jun 16, 2018
Closes:May 1, 2020
Resolved:Dec 31, 2020
Spot Scoring Time:Jun 18, 2018
What will be the best score by an AI on the full Humanity's Last Exam (HLE) before 2026?
49.6%
(37.8 - 61.6)
49.6%
(37.8 - 61.6)
60 forecasters
What will be the best non-human SAT-style score on the hard subset of the QuALITY dataset by January 1, 2040?
99.1%
(96.3 - 99.7)
99.1%
(96.3 - 99.7)
22 forecasters
What will be the best non-human SAT-style score on the hard subset of the QuALITY dataset by January 1, 2030?
96.9%
(92.6 - 98.8)
96.9%
(92.6 - 98.8)
11 forecasters