Contributed by the Risk Threshold Forecasting community.
Question
When will an 8 hour, 80% reliability time horizon be achieved on METR’s Autonomy Tasks by a Gemini 2.5 Pro scale model by Google?
Total Forecasters1
Community Prediction
18 Jul 2029
(04 Dec 2028 - 04 Mar 2030)
Make a Prediction
CDF
Quartiles | community | My Prediction |
lower 25% | 04 Dec 2028 | — |
median | 18 Jul 2029 | — |
upper 75% | 04 Mar 2030 | — |
Upper bound | ||
>Jun 2040 | 0.1% | — |
No key factors yetAdd some that might influence this forecast.
Add key factor
Authors:
Opened:Jun 25, 2025
Closes:Jan 1, 2041
Scheduled resolution:Jan 1, 2041
Spot Scoring Time:Jan 1, 2041
Authors:
Opened:Jun 25, 2025
Closes:Jan 1, 2041
Scheduled resolution:Jan 1, 2041
Spot Scoring Time:Jan 1, 2041