Will Gemini-1.5-Pro-Exp-0801 Score Above 90.35 (current #1) in Scale AI's Instruction Following Evaluation

Context:

Gemini-1.5-Pro-Exp-0801 is currently the leading model on the LMYS Arena leaderboard (https://arena.lmsys.org/).
This market is about its potential evaluation by Scale AI (https://scale.com/leaderboard).

Resolution Criteria:

The market resolves as "Yes" if the model is evaluated by Scale AI and It receives a score strictly larger than 90.35 in the Instruction Following category.
The market resolves as "No" if the model is evaluated by Scale AI and it receives a score of 90.35 or less in the Instruction Following category
The market resolves as "N/A" if either
1. Scale AI doesn't evaluate the model and add it to the leaderboard before October 1, 2024 or
2. The evaluation methodology changes before the model is evaluated.

People are also trading