Short-term AI 3.4: By June 2024 will SOTA on APPS be >= 25%?
8
130Ṁ1297
resolved Jun 8
Resolved
NO

APPS is the more challenging code benchmark (compared to HumanEval). SOTA at market creation is 15.7 by CodeRL. I will use Competition Pass@any.

Notable that the current SOTA is using a very old LLM as the base model, and yet it still beats davinci-002.

Other short-term AI 3 markets:

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ171
2Ṁ31
3Ṁ21
4Ṁ11
5Ṁ5
© Manifold Markets, Inc.TermsPrivacy