In what year will AI achieve a score of 85% or higher on the SimpleBench leaderboard?

1.5kṀ5521

2033

November 8, 2032

ALL

33%

2025

59%

2026

22%

2027

2028

2029

2030

2031

2032

Background:

SimpleBench is a 200‑item multiple‑choice test designed to probe everyday human reasoning that still eludes frontier LLMs, including spatio‑temporal reasoning, social intelligence and linguistic “trick” questions. Unlike most other benchmarks, humans still outperform AI on the SimpleBench.

State of play:

• Human reference accuracy: 83.7 %

• 2025 AI accuracy (Gemini 2.5 Pro): 62.4 %

Why this milestone matters

Everyday reasoning: Passing SimpleBench would indicate that LLMs can handle commonsense scenarios that remain brittle today.
Benchmark head‑room: Unlike MMLU, SimpleBench has not yet been “solved”, so it is a useful yard‑stick for progress to compare AI to humans.

Resolution Criteria:

This market resolves to the year in which a fully automated AI system first achieves ≥ 85% average accuracy on SimpleBench (ALL metric), subject to all of the following:

Verification – The claim must be confirmed by either
1. a peer‑reviewed paper on arXiv, or
2. a public leaderboard entry on SimpleBench Official Website or another credible source.
Compute resources – Unlimited.

Fine Print:

If the resolution criteria are unsatisfied by Jan 1, 2033 the market resolves to “Not Applicable.”

Technology

OpenAI

Technical AI Timelines

AI Impacts

Get

1,000

to start trading!

People are also trading

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

57% chance

In what year will AI achieve a score of 95% or higher on the SWE-bench Verified benchmark?

12/5/27

In what year will AI achieve a score of 95% or higher on the PutnamBench leaderboard?

11/3/28

In what year will AI achieve a score of 95% or higher on the PhysBench leaderboard?

2036

Will an AI score over 80% on FrontierMath Benchmark in 2025

4% chance

Will OpenAI models achieve ≥90% on SimpleBench by the end of 2025?

14% chance

In what year will AI achieve a score of 95% or higher on the GSO benchmark?

2/2/29

In what year will AI achieve a score of 95% or higher on the LiveCodeBench leaderboard?

3/21/28

Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?

4% chance

In what year will AI achieve a score of 95% or higher on the GPQA benchmark?

Sort by:

bought Ṁ50 YES

If the milestone is first reached in a given year, only the earliest bracket that still contains that year resolves YES; all other brackets resolve NO.
Example: Should an AI system hit 85 % on SimpleBench in 2025, only “Before 2026” wins, all other brackets resolve NO.

Why? The options are "Before" and not "During".

@SimonSteshin perhaps my statement was confusing. For example: If AI achieves the passing score in December 31, 2025 then only the bracket "Before 2026 " gets resolved "YES". The other brackets "Before 2032" for example get resolved "NO". If a person believes AI will achieve that milestone before January 1 of Year X then select that bracket, do not select a later date "just to be safe".

I prefer the brackets "Year1-Year2" as the way to setup a market but Manifold settings has been acting bizarrely and would only allow me to setup "Before Year1" which confuses readers.