Metaculus
M
Questions
Tournaments
Leaderboards
News
More
create
Log in
a
/
文
Feed Home
🤖🔭
AI Benchmarking
Topics
✨🔝
Top Questions
🗽🗳️
US Election Stakes
🕊️🌐
Global Elections
🇮🇱🇵🇸
Gaza Conflict
⏳🌀
5 Years After AGI
🦠🩺
Mpox outbreak
🇺🇦⚔️
Ukraine Conflict
🐦🦠
H5N1 Bird Flu
categories
🤖
Artificial Intelligence
🧬
Health & Pandemics
🌎
Environment & Climate
☣️
Nuclear Technology & Risks
See all categories
Hot
Movers
New
More
Filter
What will be the best perplexity score by a language model on the Penn Treebank (Word Level) by the end of 2024?
11.6
15 forecasters
5
3 comments
3
AI Technical Benchmarks
Best Penn Treebank perplexity of 2019?
Resolved :
50.1
33 forecasters
6
17 comments
17
Which language modelling benchmark will be most popular in the calendar year 2022?
Resolved :
Ambiguous
30 forecasters
4
3 comments
3
When will a language model be developed that, when tested, yields approximately human-level output?
2024-06-05
36 forecasters
14
8 comments
8
AI Demonstrations
Human-Level Language Models
36
16 comments
16