Questions
Tournaments
Services
News
Questions
Tournaments
Questions
Questions
More
Log in
Sign Up
a
/
文
Log in
Sign Up
5
comments
18
forecasters
What will be the best perplexity score by a language model on the Penn Treebank (Word Level) by the end of 2024?
community
19.9
result
Annulled
Share
Comments
Timeline
Question Info
Timeline
1d
1w
2m
all
Resolution Criteria
Background Info
Follow
embed
Authors:
Matthew_Barnett
Opened:
Sep 24, 2021
Closes:
Jan 1, 2025
Resolved:
Jan 1, 2025
Spot Scoring Time:
Sep 26, 2021
AI Progress Essay Contest
AI Technical Benchmarks
Computing and Math
Artificial Intelligence
🏆 2021-2025 Leaderboard
Similar Questions
When will a language model be developed that, when tested, yields approximately human-level output?
05 Jun 2024
(05 May 2023 - 10 Feb 2027)
05 Jun 2024
(05 May 2023 - 10 Feb 2027)
36
forecasters
In the following years, what will be the highest LLM scores on the GPQA Diamond benchmark?
2024
87.7
2025
92.8
2026
98.1
1 other
21
forecasters
What will be the best non-human SAT-style score on the hard subset of the QuALITY dataset by January 1, 2030?
96.9%
(92.6 - 98.8)
96.9%
(92.6 - 98.8)
11
forecasters
Show More Questions