Will Gemini achieve a score above 90% on the MATH benchmark?
Basic
20
แน€4.9k
Jan 1
92%
chance

The current SOTA is 84.3% from GPT-4 Code Interpreter. Code & tool use is allowed.

Get แน€1,000 play money
Sort by:

Since this market has no restrictions on public availability or zero shot, I think this should probably already resolve as yes per Gemini 1.5 report

This is a separate MATH than the one that Google reported the benchmark on. And I don't see it beating GPT4 by so much, given most of the other scores were very close.

limit order for yes 10