Will >50% of the tasks in the WebArena benchmark be solved by EOY 2024?
15
1kṀ2350
resolved Dec 18
Resolved
YES

In this tweet (https://twitter.com/ajeya_cotra/status/1684358475416064001?s=20), Ajeya Cotra (admirably) predicted that there's >50% chance >50% of the tasks in the newly announced WebArena benchmark will be solved by a single agent. Note that Ajeya didn't specify that a single agent had to solve all of them but I will resolve based on that, so there is the possibility of divergence.

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ481
2Ṁ25
3Ṁ9
4Ṁ8
5Ṁ7
© Manifold Markets, Inc.TermsPrivacy