Claude 4 #1 on EQ-Bench Creative Writing v3?

Question

This market predicts whether either Claude 4 model (claude-sonnet-4, claude-opus-4) will attain the highest position on the EQ-Bench Creative Writing v3 leaderboard, the moment both of them are put on the site. EQ-Bench evaluates large language models on creative writing tasks, with rankings available at EQ-Bench Creative Writing v3 Leaderboard. The market resolves to 'Yes' if any Claude 4 model is listed as the top model on the leaderboard the time indicated, otherwise it resolves to 'No'.

If by July 1, 2025 either of the models aren't on the benchmark leaderboard, it resolves to 'No'.

Manifold Markets · Accepted Answer

No — resolved on May 30, 2025 by Manifold Markets prediction market.

#	Trader	Total profit
1		Ṁ9
2		Ṁ7

🏅 Top traders

People are also trading

Related questions