Will GPT-4 be able to consistently solve college-level discrete math problems?

48

970Ṁ4738

resolved May 16

Resolved

NO

1H

6H

1D

1W

1M

ALL

I will use the class CS 70 at UC Berkeley to test this (https://www.eecs70.org/), once I get access to GPT-4. I will take all text-only homework problems from the most recent iteration of the class, and copy and paste them in, maybe with a minimal amount of prompt engineering (e.g. "pretend you are a brilliant mathematician" or something). Even if GPT-4 has image recognition abilities, I won't use problems with images.

After this, I will grade GPT-4's responses. I am currently a TA for the class, and I was a grader for the class for two semesters, so the grading will be as close to real life as it can be. If GPT-4 scores above 73%, I will resolve this market positively. (If a student in CS 70 scores above 73% on a homework, they get 100% on it.) Otherwise, I will resolve this negatively.

If GPT-4 releases under a different name, I'll test that model.

Note: ChatGPT and Bing Chat both cannot do this, they produce good-looking answers but consistently make incorrect statements like "9 is prime" or something.

GPT-4 speculation

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ176
2		Ṁ70
3		Ṁ68
4		Ṁ50
5		Ṁ46

People are also trading

Will GPT-5 be able to solve A::B system puzzles consistently

Will GPT-5 not be terrible at the "Numbers Game"?

Will GPT-5 be capable of some form of online learning?

Will OpenAI's next major LLM (after GPT-4) solve more than 2 of the first 5 new Project Euler problems?

GPT-4 #5: Will GPT-4 be a dense model?

Will GPT-5 be able to get gold on the International Mathematical Olympiad?

Will GPT-4 escape?

Will GPT-5 get the Monty *Fall* problem correct?

Will GPT-5 be capable of achieving superhuman performance in at least one exam that is typically taken by humans?

Related questions

Will GPT-5 be able to solve A::B system puzzles consistently

Will GPT-5 not be terrible at the "Numbers Game"?

Will GPT-5 be capable of some form of online learning?

Will OpenAI's next major LLM (after GPT-4) solve more than 2 of the first 5 new Project Euler problems?

GPT-4 #5: Will GPT-4 be a dense model?

Will GPT-5 be able to get gold on the International Mathematical Olympiad?

Will GPT-4 escape?

Will GPT-5 get the Monty *Fall* problem correct?

Will GPT-5 be capable of achieving superhuman performance in at least one exam that is typically taken by humans?

© Manifold Markets, Inc.•Terms•Privacy