Question
What will the state-of-the-art language modelling performance on WikiText-103 be on 2023-02-14 in perplexity, amongst models not trained on extra data?
Resolved:14.8Total Forecasters26
Community Prediction
15.5
(13.9 - 15.7)
Make a Prediction
CDF
Lower bound | community | My Prediction |
<7 | 1.0% | — |
Quartiles | ||
lower 25% | 13.92 | — |
median | 15.53 | — |
upper 75% | 15.74 | — |
What was the final result?14.8
Community Baseline Score
12.1
Community Peer Score
51.2
Authors:
Opened:Feb 14, 2021
Closes:Apr 14, 2021
Resolves:Feb 14, 2023
Spot Scoring Time:Feb 16, 2021
Authors:
Opened:Feb 14, 2021
Closes:Apr 14, 2021
Resolves:Feb 14, 2023
Spot Scoring Time:Feb 16, 2021