Contributed by the Risk Threshold Forecasting community.
0 comments
1 forecaster
When will an 8 hour, 80% reliability time horizon be achieved on METR’s Autonomy Tasks by a Gemini 2.5 Pro scale model by Google?
Current estimate
18 Jul 2029
No key factors yetAdd some that might influence this forecast.
Authors:
Opened:Jun 25, 2025
Closes:Jan 1, 2041
Scheduled resolution:Jan 1, 2041
Spot Scoring Time:Jun 28, 2025