Metaculus
M
Questions
Tournaments
Leaderboards
News
More
create
Log in
a
/
文
Feed Home
🤖🔭
AI Benchmarking
Topics
✨🔝
Top Questions
🗽🗳️
US Election Stakes
🕊️🌐
Global Elections
🇮🇱🇵🇸
Gaza Conflict
⏳🌀
5 Years After AGI
🦠🩺
Mpox outbreak
🇺🇦⚔️
Ukraine Conflict
🐦🦠
H5N1 Bird Flu
categories
🤖
Artificial Intelligence
🧬
Health & Pandemics
🌎
Environment & Climate
☣️
Nuclear Technology & Risks
See all categories
Hot
Movers
New
More
Filter
On January 1, 2025, which frontier AI lab will have a publicly available model with the highest score on the MMLU benchmark?
OpenAI
46.28%
Anthropic
29.45%
Google DeepMind
19.56%
3 others
9
12 comments
12
Understanding AI With Timothy B. Lee
Will AI capabilities plateau with the next generation (GPT-5, etc.) of language models?
Google DeepMind / Gemini 2 Ultra
15%
Anthropic / Claude 4 Opus
12%
OpenAI / GPT-5
7%
20
33 comments
33
Will the listed AI companies/labs pause any of their models before January 1st, 2025 because they detect dangerous capabilities?
OpenAI
1%
Google DeepMind
1%
Anthropic
1%
12
15 comments
15
AI Safety
What type of research group will develop the first transformative AI?
For-profit
80%
Government
10%
Non-profit
7%
3 others
17
7 comments
7
Situational Awareness
Will the listed AI companies/labs practice implementing emergency response plans before January 1st, 2025?
OpenAI
4%
Google DeepMind
1%
Anthropic
1%
5
4 comments
4
AI Safety
Will the listed AI companies/labs create a board risk committee before January 1st, 2025?
OpenAI
1%
Google DeepMind
1%
Anthropic
1%
3
no comments
0
AI Safety
Will the listed AI companies/labs have merge and assist clauses on January 1st 2025?
OpenAI
95%
Anthropic
5%
Google DeepMind
3%
4
2 comments
2
AI Safety
condition
3 AI labs constrain capabilities before 2026?
12 forecasters
if yes
if no
Date of Artificial General Intelligence
2030-03-07
Date of Artificial General Intelligence
2030-03-11
7
no comments
0
condition
LLM passes ARA before 2025?
28 forecasters
if yes
if no
LLM training paused for dangerous capability? (OpenAI)
5%
5%
LLM training paused for dangerous capability? (OpenAI)
1%
1%
6
1 comment
1
Conditional Cup
Will the highest Elo LLM on Chatbot Arena be non-proprietary during 2024?
0.1%
39 forecasters
8
9 comments
9
Load More