Contributed by the JetBrains community.
0 comments
4 forecasters
For these benchmarks, what percentage of problems do you estimate the top-performing AI model or agent will be able to solve by December'25?
AI2 Reasoning Challenge97.9
Toloka's µ-MATH94.5
Graduate-Level Google-Proof Q&A92.6
Authors:
Opened:Feb 18, 2025
Closes:Nov 30, 2025
Scheduled resolution:Dec 14, 2025