Will Gemini achieve a score above 90% on the MATH benchmark?
13
180
290
2025
46%
chance

The current SOTA is 84.3% from GPT-4 Code Interpreter. Code & tool use is allowed.

Get Ṁ200 play money
Sort by:

This is a separate MATH than the one that Google reported the benchmark on. And I don't see it beating GPT4 by so much, given most of the other scores were very close.

bought Ṁ0 of YES

limit order for yes 10