Will Gemini-1.5-Pro-Exp-0801 Score Above 90.35 (current #1) in Scale AI's Instruction Following Evaluation
Mini
3
30
Oct 1
53%
chance

Context:

Resolution Criteria:

  • The market resolves as "Yes" if the model is evaluated by Scale AI and It receives a score strictly larger than 90.35 in the Instruction Following category.

  • The market resolves as "No" if the model is evaluated by Scale AI and it receives a score of 90.35 or less in the Instruction Following category

  • The market resolves as "N/A" if either

    1. Scale AI doesn't evaluate the model and add it to the leaderboard before October 1, 2024 or

    2. The evaluation methodology changes before the model is evaluated.

Get Ṁ1,000 play money