Will a text model achieve 100% performance on the MMLU in five years?
3
42
90
2028
28%
chance

This video predicts that a model will achieve 100% performance on the Massive Multitask Language Understanding Benchmark.

This will resolve YES if a model is listed as having achieved 100 percent performance on the MMLU on papers with code, in a technical document, or published article by 3:00 PM EST June 28, 2028. This will resolve NO if a model is not listed as ahving achieved 100 percent performance on the MMLU.

Get Ṁ200 play money