Metaculus
M
Questions
Tournaments
Leaderboards
News
More
create
Log in
a
/
文
jacob.steinhardt
Predictions
18
Comments
13
Member Since
December 2015
Overview
Track Record
Medals
Comments
Questions
Overview
Track Record
Medals
Comments
Questions
Questions by jacob.steinhardt
What will be state-of-the-art accuracy on the Massive Multitask dataset on the following dates?
June 30, 2025
94.6
June 30, 2024
88.7
June 30, 2023
86.4
12
38 comments
38
AI Technical Benchmarks
What will be state-of-the-art performance on the MATH dataset on the following dates?
June 30, 2025
97.1
June 30, 2024
91.1
June 30, 2023
69.6
20
32 comments
32
AI Technical Benchmarks