Gary Marcus 2024 predictions

1.4kṀ4308

resolved Jan 22

Resolved

YES

7-10 GPT-4 level models

Resolved

YES

Price wars

Resolved

YES

No robust solution to hallucinations

Resolved

YES

Modest profits, split 7-10 ways

Resolved

71%

No massive advance (no GPT-5, or disappointing GPT-5)

Resolved

N/A

Very little moat for anyone

Resolved

Modest lasting corporate adoption

Gary Marcus made this tweet https://x.com/garymarcus/status/1766871625075409381?s=46&t=B66Otgh2q0Cl91N3A5P9ZA

with 7 predictions on LLMs this market will track them and their outcomes. I will not trade on this market. Some will be pretty subjective I will do my best to find a consensus resolution but may well end up resolving them N/A if it is not clear.

Gary Marcus GPT-4 predictions

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ120
2		Ṁ113
3		Ṁ85
4		Ṁ51
5		Ṁ44

People are also trading

Gary Marcus 2029 AI predications

Will Gary Marcus get AGI-pilled in 2025?

5% chance

Will Gary Marcus be accurate on at least 50% of his predictions on AI in 2029?

57% chance

Will Gary Marcus's Brier Score of 2024+2025 predictions be lower than 0.50 by EOY2028?

55% chance

Will Gary Marcus state that AGI has been achieved before 2030?

19% chance

Will Gary Marcus claim to have a P(doom) of at or higher than 10% at some point before 2027?

20% chance

Will Matt Marcus (2022 Forbes Under 30) be charged with a serious crime before 2030?

11% chance

Will Gary Marcus' legs be turned into paperclips (or similarly affected by AI) before he predicts AGI within 2.5 years?

12% chance

In 2028, will LLMs still be able to get Gary Marcus to make egregious errors?

91% chance

Possible outcomes of the bet on AI progress between Gary Marcus and Miles Brundage

11 Comments

42 Holders

128 Trades

Sort by:

Gary Marcus believes he was correct on all seven: https://open.substack.com/pub/garymarcus/p/25-ai-predictions-for-2025-from-marcus

https://x.com/GaryMarcus/status/1868101345657729513

It is curious the extent to which Gary Marcus wants to convince people that he's reliably good at predicting the future, and yet he won't create a public manifold profile so he can be held to account 🤣

-- Gary Marcus 2022

A lot of it seems to come from the long term narrative he has embraced that "neural networks can't reason"

https://garymarcus.substack.com/p/llms-dont-do-formal-reasoning-and

Looking at o1-pro solving math olympiad problems in 2024, this position just doesn't make sense anymore.

It will be interesting to see what evidence causes him to admit that he was wrong in a major way about this position, considering that he is still in denial about it in 2024. I lot of his current energy seems to be directed towards (re)asserting himself as a good source of truth by making new predictions like the ones linked in this question 🤷

Problem is he's just doing it on twitter and tone policing people who disagree 😅🤣

https://x.com/GaryMarcus/status/1834412282488783195

@MalachiteEagle Have the specific experiments that Gary Marcus suggests been tried for o1-pro? For example, can it be misled by irrelevant information in the prompt?

@TimothyJohnson5c16 good question! Haven't seen anyone trying that yet on Twitter

@alexlitz What's our current count here? This is the current Chatbot Arena leaderboard, but some of these are plausibly the same model and shouldn't be double counted:

1

Gemini-1.5-Pro-Exp-0801

1299 +4/-5 15244 Google Proprietary 2023/11 2

GPT-4o-2024-05-13

1286 +3/-4 72589 OpenAI Proprietary 2023/10 3

GPT-4o-mini-2024-07-18

1277 +4/-5 16064 OpenAI Proprietary 2023/10 3

Claude 3.5 Sonnet

1271 +3/-4 42939 Anthropic Proprietary 2024/4 4

Gemini Advanced App (2024-05-14)

1266 +3/-3 52126 Google Proprietary Online 4

Meta-Llama-3.1-405b-Instruct

1264 +5/-4 13831 Meta Llama 3.1 Community 2023/12 6

Gemini-1.5-Pro-001

1260 +3/-3 64638 Google Proprietary 2023/11 6

Gemini-1.5-Pro-Preview-0409

1257 +3/-4 55593 Google Proprietary 2023/11 6

GPT-4-Turbo-2024-04-09

On the current leaderboard, it seems like about a dozen different organizations have GPT-4 level models. But it's possible that some of them are derived from the leading open-source models, so I'm not sure how to rate this.

bought Ṁ10 NO

Can you explain how you intend to resolve this one?

I work for Microsoft, and we're all in on pushing LLMs in every possible product. That sounds like much more than modest adoption.