MP
Will a GPT-5-worthy model be able to break 50% on NYT Connections benchmark?
Resolved
YES
19
1k
Ṁ12k
resolved Oct 26