Will any model pass an "undergrad proofs exam" Turing test by 2027?
Plus
20
Ṁ8712027
77%
chance
1D
1W
1M
ALL
The model receives each question as text (or text + images), outputs an answer as text + images, and is graded as part of a pool with human students who also took the test.
"Pass" means >=70%
Has to be a proofs-based exam, e.g. abstract algebra, topology, linear algebra if it's proofs heavy.
There are probably undergrad math exams *somewhere* that are very easy, so I will be exercising my judgment on whether the exam "counts". Unfortunately I do not have examples to hand of what I consider reasonable, but something like "would be a medium difficulty 200-level proofs exam at a top-tier university".
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Sort by:
Ultra-likely.
Topology was almost trivial, especially comparable to IMO questions. I recall “prove at least 3 of 12 theorems” being the final exam, pretty sure an AI could blow past that even today.
The handwriting and telling the AI not to be too giga-brained (make a few mistakes and don’t solve them all) would be harder than the solving.
Related questions
Related questions
Will AI pass the Longbets version of the Turing test by the end of 2029?
58% chance
Before 2030, will an AI complete the Turing Test in the Kurzweil/Kapor Longbet?
51% chance
Will AI pass Video Turing Test by 2030?
69% chance
Will a smart agent pass our Turing test by the end of 2025?
59% chance
Will the Twitter Turing Test be passed by 2025?
68% chance
Will any AI be able to explain formal language proofs to >=50% of IMO problems by the start of 2025?
60% chance
Will models be able to do the work of an AI researcher/engineer before 2027?
40% chance
In what year will there be an AI capable of passing a high-quality Turing test?
Will AI pass the Bob Ross Turing Test by 2035?
70% chance
In 2029, will any AI be able to take an arbitrary proof in the mathematical literature and translate it into a form suitable for symbolic verification? (Gary Marcus benchmark #5)
65% chance