MANIFOLD

HumanEval 90% #1: Will pass@1 performance on the HumanEval benchmark be >= 90% by 2024?

Ṁ250Ṁ2k

resolved Oct 22

Resolved

YES

ALL

Benchmark link: https://paperswithcode.com/sota/code-generation-on-humaneval

pass@1 means the model gets a single attempt.

Market context

Technical AI Timelines

Get

1,000

to start trading!

🏅 Top traders

#	Trader	Total profit
1		Ṁ396
2		Ṁ268
3		Ṁ56
4		Ṁ30
5		Ṁ21

2 Comments

11 Holders

50 Trades

Sort by:

Unless I've misunderstood what this market is asking, it seems like the answer is yes. At the linked benchmark page, the top model is currently sitting at 94.4%

@js Thank you for pointing this out

People are also trading

Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?

+3% 1d45% chance

Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?

35% chance

Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2030?

84% chance

Top score on Humanity's Last Exam > 50% by 2027?

98% chance

Top score on Humanity's Last Exam > 50% by 2028?

98% chance

Top score on Humanity's Last Exam > 50% by 2029?

99% chance

What will be the best performance on EnigmaEval by December 31st 2026?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2030?

33% chance

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2027?

7% chance

In what year will AI achieve a score of 95% or higher on the PhysBench leaderboard?

2036

🏅 Top traders

People are also trading

Related questions