HumanEval 90% #1: Will pass@1 performance on the HumanEval benchmark be >= 90% by 2024?
13
847
250
resolved Oct 22
Resolved
YES

Benchmark link: https://paperswithcode.com/sota/code-generation-on-humaneval

pass@1 means the model gets a single attempt.

Get Ṁ200 play money

🏅 Top traders

#NameTotal profit
1Ṁ396
2Ṁ268
3Ṁ56
4Ṁ30
5Ṁ21
Sort by:
bought Ṁ100 of YES

Unless I've misunderstood what this market is asking, it seems like the answer is yes. At the linked benchmark page, the top model is currently sitting at 94.4%

@js Thank you for pointing this out

More related questions