M

/Risk Threshold Forecasting

Questions

Contributed by the Risk Threshold Forecasting community.

Contributed by the Risk Threshold Forecasting community.

When will an 8 hour, 80% reliability time horizon be achieved on METR’s Autonomy Tasks by a Claude Sonnet 4 scale model by Anthropic?

Current estimate

Key Factors

No key factors yetAdd some that might influence this forecast.

Add key factor

No forecasts yet

Forecast Timeline

No key factors yetAdd some that might influence this forecast.

Add key factor

Authors:

Opened:

Jun 25, 2025

Closes:

Jan 1, 2041

Scheduled resolution:

Jan 1, 2041

Spot Scoring Time:

Jun 28, 2025

Risk Threshold Forecasting

Artificial Intelligence

Anthropic launches Claude Sonnet 4.5, its best AI model for coding

TechCrunch•Sep 29, 2025

Anthropic unveils latest AI model, aiming to extend its lead in coding intelligence

Business Insider•Sep 29, 2025

Anthropic releases Claude Sonnet 4.5 in latest bid for AI agents and coding supremacy

Verge•Sep 29, 2025

Learn more about Metaculus NewsMatch