
Using the same resolution criteria as in the Metaculus question about this.
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ56 | |
2 | Ṁ53 | |
3 | Ṁ45 | |
4 | Ṁ31 | |
5 | Ṁ22 |
People are also trading
I gave ChatGPT literally the easiest problem on the Putnam, and it could not be solved.
I know it's not the same, since ChatGPT is an LLM, while there are better more specialized AI out there. But the problem I gave GPT was so easy compared to other Putnam problems. It wasn't a proof-based one, but it had a definitive answer.
@MatthewBarnett I'd be fine with that, though it gets a little iffy if it solves an old exam that's had answers up online, since they could have gotten discussed in other forums that were in its training data.
From Metaculus:
“This question resolves on the date during which a computer program first clearly demonstrates the ability to receive a perfect score on the William Lowell Putnam Mathematical Competition, without cheating, and within the time limits given in the real-world competition. Cheating includes training on content that could conceivably spoil the solutions to the competition, and includes having access to external equipment normally forbidden during the competition that can be used to aid solving the problems, or advice from other mathematicians. Thus, Metaculus administrators should be careful not to resolve this question prematurely.
In the strictest case, the model should be tested on the most recent Putnam Competition, after having trained the model prior to the release of the most recent solutions. Here is an archive of Putnam Competition problems going back to 1985. Since it is generally understood that Putnam problems have become harder over time, this question will not consider any candidate program that receives a perfect score on a Putnam examination from prior to 2000 as eligible to trigger positive resolution.”
@JimHays Right, I'm saying it would be difficult to rule out that old competitions were in the training data for modern LLMs.
Metaculus thinks this won't happen until 2035.
https://www.metaculus.com/questions/11671/when-will-ai-master-putnam-math/