Will SOTA on any major code benchmark go up at least twice this year?
12
83
แน€190
resolved Jan 1
Resolved
YES

Major code benchmarks include:

  • HumanEval

  • APPS

  • Performance on any major code competition (IOI, ICPC, the various competition websites)

A single benchmark needs to go up twice. So a single model that improves SOTA on HumanEval and APPS would not resolve the market YES. We need two different models that both get SOTA on the same benchmark.

Get แน€200 play money

๐Ÿ… Top traders

#NameTotal profit
1แน€63
2แน€13
3แน€10
4แน€8
5แน€7