9 comments
36 forecasters

What will be the best non-human SAT-style score on the hard subset of the QuALITY dataset by January 1, 2025?

community
80.6
result
69.7