HumanEval 90% #3: Will pass@1 performance on the HumanEval benchmark be >= 90% by 2026?

8

Ṁ190Ṁ367

resolved Jun 10

Resolved

YES

1H

6H

1D

1W

1M

ALL

Benchmark link: https://paperswithcode.com/sota/code-generation-on-humaneval

pass@1 means the model gets a single attempt.

Market context

Technical AI Timelines

Get

1,000

to start trading!

🏅 Top traders

#	Trader	Total profit
1		Ṁ35
2		Ṁ21
3		Ṁ11
4		Ṁ4
5		Ṁ2

Sort by:

GPT-4 is 82% zero shot https://arxiv.org/pdf/2303.12712.pdf#page=21

People are also trading

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?

Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2030?

Top score on Humanity's Last Exam > 50% by 2027?

Top score on Humanity's Last Exam > 50% by 2028?

Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?

Top score on Humanity's Last Exam > 50% by 2029?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2030?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2027?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2028?

Related questions

Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?

Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?

Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2030?

Top score on Humanity's Last Exam > 50% by 2027?

Top score on Humanity's Last Exam > 50% by 2028?

Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?

Top score on Humanity's Last Exam > 50% by 2029?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2030?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2027?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2028?

© Manifold Markets, Inc.•Terms•Privacy