• Questions
  • Tournaments
  • Services
  • News
  • Questions
  • Tournaments
  • Questions
  • Questions
Feed Home
👥
Communities
💎
Metaculus Cup
🇮🇷🇮🇱
Iran-Israel Conflict
⚡
Current Events
🏛️
POTUS Predictions
💵
Fiscal Showdown
🏆
Leaderboards
Topics
✨🔝
Top Questions
⏳
AI 2027
🏦
Big Beautiful Bill
🇹🇼🇨🇳
The Taiwan Tinderbox
categories
🦠
Health & Pandemics
🌱
Environment & Climate
☢️
Nuclear Technology & Risks
🤖
Artificial Intelligence
See all categories
  • About
  • API
  • FAQ
  • forecasting resources
  • For Journalists
  • Contact
  • Careers
GuidelinesPrivacy PolicyTerms of Use
ForbesScientific AmericanTimeVoxYale NewsNature

Contributed by the Risk Threshold Forecasting community.

When will Google first report that an AI system has reached or surpassed the following Machine Learning R&D risk levels?

Forecast revealed in 3 days

Contributed by the Risk Threshold Forecasting community.

When will OpenAI first report that an AI system has achieved the following a risk levels on AI Self-improvement?

Forecast revealed in 3 days

Q1 AI Benchmark Results: Pro Forecasters Crush Bots

14
3 comments3
Q1 AI Forecasting Benchmark Tournament

Contributed by the Risk Threshold Forecasting community.

When will OpenAI first report that an AI system has achieved the following a risk levels on Biological and Chemical?

Forecast revealed in 3 days

Contributed by the Risk Threshold Forecasting community.

When will Google first report that an AI system has reached or surpassed the following Cyber risk levels?

Forecast revealed in 3 days

Contributed by the Risk Threshold Forecasting community.

When will Google first report that an AI system has reached or surpassed the following Instrumental Reasoning risk levels?

Forecast revealed in 3 days

Contributed by the Risk Threshold Forecasting community.

When will Anthropic first report that an AI system has reached or surpassed the following AI R&D risk levels?

Forecast revealed in 4 days

Contributed by the Risk Threshold Forecasting community.

When will 80% accuracy be achieved on Cybench by a GPT-4.1 scale model by OpenAI?

Contributed by the Risk Threshold Forecasting community.

When will 80% accuracy be achieved on Cybench by a Claude Opus 4 scale model by Anthropic?

Contributed by the Risk Threshold Forecasting community.

When will 80% accuracy be achieved on Cybench by a Claude Sonnet 4 scale model by Anthropic?