Group
What will be state-of-the-art performance on the MATH dataset on the following dates?
Make a Prediction
Date | resolution | ||
---|---|---|---|
69.6 | |||
- | |||
Date | 25th | median | 75th |
94.7 ... | 97.2 ... | 98.8 ... |
CDF
lower 25%
median
upper 75%
70.71
76.3
79.53
What was the final result?69.6
Community Baseline Score
7.1
Community Peer Score
6.7
Forecast Timeline
Authors:
Opened:
Closes:
Scheduled resolution:
Learn more about Metaculus NewsMatch
What will be state-of-the-art accuracy on the Massive Multitask dataset on the following dates?
94.6
What will be the best performance on FrontierMath by December 31st 2025?
43.4
What will the be the state-of-the-art performance on image classification on ImageNet in top-1 accuracy on the following dates?
91.9
Comments
? comments
Authors:
Opened:
Closes:
Scheduled resolution:
Learn more about Metaculus NewsMatch
What will be state-of-the-art accuracy on the Massive Multitask dataset on the following dates?
94.6
What will be the best performance on FrontierMath by December 31st 2025?
43.4
What will the be the state-of-the-art performance on image classification on ImageNet in top-1 accuracy on the following dates?
91.9