Will Gemini-1.5-Pro-Exp-0801 Score Above 1165 in Scale AI's Math Evaluation
Mini
1
5
Dec 1
48%
chance

Context:

Resolution Criteria:

  • The market resolves as "Yes" if the model is evaluated by Scale AI and It receives a score strictly larger than 96.60 in the Math category.

  • The market resolves as "No" if the model is evaluated by Scale AI and it receives a score of 96.60 or less in the Math category

  • The market resolves as "N/A" if either

    1. Scale AI doesn't evaluate the model and add it to the leaderboard before October 1, 2024 or

    2. The evaluation methodology changes before the model is evaluated.

Get Ṁ1,000 play money