34 comments
103 forecasters
What will be the state-of-the-art language modelling performance (in perplexity) on WikiText-103 by the following dates?
This question is closed for predictions, and is waiting to be resolved
Authors:
Opened:Dec 14, 2020
Closed:Feb 13, 2025
Scheduled resolution:Dec 13, 2026
In the following years, what will be the highest LLM scores on the GPQA Diamond benchmark?
21 forecasters
What will the be the state-of-the-art performance on image classification on ImageNet in top-1 accuracy on the following dates?
78 forecasters
When will a language model be developed that, when tested, yields approximately human-level output?
05 Jun 2024
(05 May 2023 - 10 Feb 2027)
05 Jun 2024
(05 May 2023 - 10 Feb 2027)
36 forecasters