Will Claude 3.5 Haiku be better than Claude 3 Opus?
Basic
31
Ṁ4735
resolved Nov 5
Resolved
NO

I will resolve it according to the most recent benchmark results from livebench.ai as soon as the results are available.

In the current benchmark Claude 3 Opus has a score of 50,03 on the 'Global average', while Claude 3.5 Sonnet scores 59,80.

Get
Ṁ1,000
and
S3.00
Sort by:
sold Ṁ55 YES

3.5 Haiku looks comparable to 4o-mini (which is 7 points behind 3 Opus on LiveBench) and somewhat below 1.5 Flash (which is 0.5 points behind 3 Opus on LiveBench).

bought Ṁ30 NO

Ya, I'd consider it well below gemini-1.5-flash-002.

The GPQA scores are barely above Claude 3 Sonnet.

bought Ṁ10 YES

@JoshYou now it's overcorrected imo, I'm at 30-40%

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules