Will this Dan Mac tweet hold up?
2
1kṀ2540
resolved Sep 29
Resolved
N/A

https://x.com/daniel_mac8/status/1972337371632013538

The claims that will be tested:
1. Anthropic release Sonnet 4.5 this week. (Before October 6th)

  1. Tops SWE-Bench and Terminal-Bench

  1. 3 hour tasks on METR Long Duration

  2. Price parity with GPT-5

If there's ambiguity, I will message Daniel and try to get clarification, but will resolve it fairly per my judgement.

Get
Ṁ1,000
to start trading!
Sort by:

For the record, this market was created before I heard about the release.

bought Ṁ1,250 NO

Feel free to resolve N/A, if you like. Otherwise looks like it's more expensive than gpt-5

@ian Yeah, if the timing wasn't so close, I would have let this stand, but without time to speculate it's no fun.

© Manifold Markets, Inc.TermsPrivacy