By when will 85% be reached on the public evaluation set on ARC-AGI-Pub?
67
10kṀ94k
resolved Jan 7
Resolved
NO
01/01/2025
Resolved
YES
01/01/2026
Resolved
YES
01/01/2028
Resolved
YES
01/01/2030

See https://arcprize.org/arc-agi-pub.

This allows for arbitrary API usage unlike the ARC-AGI prize.

I will resolve based on when the date when the method is first reported even if it is not submitted until substantially later (due to e.g. no public API).

I will apply a restriction roughly in line with the $10,000 runtime cost contraint, though it is unclear how this should resolve with respect to private models with no known API price.

I will exclude solutions that seem heavily overfit based on the semi-private evaluation set.

(Cut off data is resolved based on midnight PT.)

Market context
Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ1,551
2Ṁ778
3Ṁ776
4Ṁ438
5Ṁ404
Sort by:

I think this market should resolved yes based on https://x.com/arcprize/status/1999182732845547795 and https://arcprize.org/leaderboard. Any objections?

@RyanGreenblatt I think this resolves NO? o3 did pass 85%, but it was well above the cost constraints.

sold Ṁ477 YES

wow, 83% on public with high efficiency. So close (and I feel the mana punishment for trading too fast before reading the fine print)

If 85% is reached on the main ARC-AGI prize track and there isn't a clear reason to think this wouldn't transfer to ARC-AGI-Pub, I would also resolve this market to true even if no one submits to ARC-AGI-Pub.

© Manifold Markets, Inc.TermsPrivacy