1
17 forecasters

What will be the best non-human SAT-style score on the hard subset of the QuALITY dataset by January 1, 2030?

Current estimate
95.6%