![](/_next/image?url=https%3A%2F%2Ffirebasestorage.googleapis.com%2Fv0%2Fb%2Fmantic-markets.appspot.com%2Fo%2Fdream%252FnUWztGHsN4.png%3Falt%3Dmedia%26token%3Dde9734a6-0be0-46f6-9f76-364ec6e0770b&w=3840&q=75)
HumanEval 90% #2: Will pass@1 performance on the HumanEval benchmark be >= 90% by 2025?
Mini
8
Ṁ444resolved Jun 10
Resolved
YES1D
1W
1M
ALL
Benchmark link: https://paperswithcode.com/sota/code-generation-on-humaneval
pass@1 means the model gets a single attempt.
Get Ṁ600 play money
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ20 | |
2 | Ṁ8 | |
3 | Ṁ2 | |
4 | Ṁ0 |
Related questions
Related questions
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
AI resolves at least X% on SWE-bench assistance, by 2025?
AI resolves at least X% on SWE-bench without any assistance, by 2028?
Will >50% of the tasks in the WebArena benchmark be solved by EOY 2024?
62% chance
Will a LLM beat human experts on GPQA by Jan 1, 2025?
53% chance
Human-machine intelligence parity achieved before 2028
51% chance
Will AI pass Video Turing Test by 2030?
68% chance
By 2025, will most well-educated people expect AI to within 10 years be better at intellectual work than 99% of humans?
15% chance
Human-machine intelligence parity achieved before 2030
70% chance
Will effective personality simulation be available by 2030?
54% chance