HumanEval 90% #3: Will pass@1 performance on the HumanEval benchmark be >= 90% by 2026?
83%
chance

Benchmark link: https://paperswithcode.com/sota/code-generation-on-humaneval

pass@1 means the model gets a single attempt.

Sort by:
qumeric avatar
Valery Cherepanovbought Ṁ30 of YES