Will any model pass an "undergrad proofs exam" Turing test by 2027?
16
22
Ṁ647Ṁ640
2027
74%
chance
1D
1W
1M
ALL
The model receives each question as text (or text + images), outputs an answer as text + images, and is graded as part of a pool with human students who also took the test.
"Pass" means >=70%
Has to be a proofs-based exam, e.g. abstract algebra, topology, linear algebra if it's proofs heavy.
There are probably undergrad math exams *somewhere* that are very easy, so I will be exercising my judgment on whether the exam "counts". Unfortunately I do not have examples to hand of what I consider reasonable, but something like "would be a medium difficulty 200-level proofs exam at a top-tier university".
Get Ṁ200 play money
Sort by:
Ultra-likely.
Topology was almost trivial, especially comparable to IMO questions. I recall “prove at least 3 of 12 theorems” being the final exam, pretty sure an AI could blow past that even today.
The handwriting and telling the AI not to be too giga-brained (make a few mistakes and don’t solve them all) would be harder than the solving.
More related questions
Related questions
Will AI pass the Longbets version of the Turing test by the end of 2029?
52% chance
Will a Large Language Model prove an important math theorem by end of 2024?
9% chance
Will language models be able to solve simple graphical mazes by the end of 2025?
65% chance
Will the majority of mathematicians rely on formal computer proof assistants before the end of 2040?
65% chance
Will we have a formalized proof of the Modularity theorem by 2029-05-01?
60% chance
Will a smart agent pass our Turing test by the end of 2025?
65% chance
Will an an LLM be able to pass something equivalent to Yann LeCun's 7-gear test by the end of 2024?
71% chance
Will we have a formalized proof of Fermat's last theorem by 2029-05-01?
59% chance
In 2029, will any AI be able to take an arbitrary proof in the mathematical literature and translate it into a form suitable for symbolic verification? (Gary Marcus benchmark #5)
39% chance
Will an AI get a perfect SAT score before 2025?
81% chance