Metaculus
M
Questions
Tournaments
Leaderboards
News
More
create
Log in
a
/
文
Feed Home
👥
Communities
🔭
Bridgewater 2025
🤖
AI Benchmarking
📖
ACX 2025
Topics
✨🔝
Top Questions
🐦🦠
H5N1 Bird Flu
🕊️🌐
Global Elections
⏳🌀
5 Years After AGI
🇮🇱🇵🇸
Gaza Conflict
🦠🩺
Mpox outbreak
🇺🇦⚔️
Ukraine Conflict
categories
🤖
Artificial Intelligence
🧬
Health & Pandemics
🌎
Environment & Climate
☣️
Nuclear Technology & Risks
See all categories
Hot
Movers
New
More
Filter
Best Penn Treebank perplexity of 2019?
Resolved :
50.1
39.6
33 forecasters
6
17 comments
17
Which language modelling benchmark will be most popular in the calendar year 2022?
Resolved :
Ambiguous
2.99
30 forecasters
4
3 comments
3
When will a language model be developed that, when tested, yields approximately human-level output?
2024-06-05
36 forecasters
14
8 comments
8
AI Demonstrations
What will be the best perplexity score by a language model on the Penn Treebank (Word Level) by the end of 2024?
Resolved :
Annulled
19.9
18 forecasters
4
5 comments
5
AI Technical Benchmarks
Human-Level Language Models
36
17 comments
17