Question

Best Penn Treebank perplexity of 2019?

Resolved:50.1

17 comments17

Total Forecasters33

Community Prediction

39.6

(35.5 - 44.3)

Make a Prediction

PDF

CDF

	<35	lower 25%	median	upper 75%
community	22.2%	35.51	39.6	44.28
My Prediction	—	—	—	—

Lower bound	community	My Prediction
<35	22.2%	—
Quartiles
lower 25%	35.51	—
median	39.6	—
upper 75%	44.28	—

What was the final result?50.1

Community Peer Score

22.1

Authors:

DanielFilan

Opened:

Jan 19, 2019

Closes:

Jan 1, 2020

Resolves:

Jan 12, 2020

Spot Scoring Time:

Jan 21, 2019

Computing and Math

Artificial Intelligence

2019 Leaderboard

When will a language model be developed that, when tested, yields approximately human-level output?

05 Jun 2024

What will be the best non-human SAT-style score on the hard subset of the QuALITY dataset by January 1, 2040?

98.2

What will be the best non-human SAT-style score on the hard subset of the QuALITY dataset by January 1, 2030?