Will Roon's Tweet about various great minds discussing flops age well by 2027?

1kṀ1275

2027

46%

chance

ALL

Roon (https://twitter.com/tszzl/status/1736286837822595177): Great minds discuss flops; average minds discuss data; small minds discuss architecture.

Eliezer Yudkowsky: This will not age well.

Roon: why

If non-obvious (meaning not trading very high or very low), will be resolved via Twitter poll asking if this aged well (with no additional wording), or other similar survey mechanism.

Will resolve to YES or NO, not to a percentage. No aging mid.

Technology

Get

1,000

to start trading!

People are also trading

Will "Thoughts on AI 2027" make the top fifty posts in LessWrong's 2025 Annual Review?

16% chance

Will "Read the Roon" make the top fifty posts in LessWrong's 2024 Annual Review?

6% chance

Will roon end up deleting more than 10% of his pre-2024 tweets in 2025?

43% chance

Will >90% of Elon re/tweets/replies on 19 December 2025 be about AI risk?

6% chance

Will "AI Doomerism in 1879" make the top fifty posts in LessWrong's 2025 Annual Review?

14% chance

Will roon become a father/produce roonbabby by EOY 2030?

59% chance

Will "Consider chilling out in 2028" make the top fifty posts in LessWrong's 2025 Annual Review?

14% chance

Will "Slowdown After 2028: Compute, RLVR Uncertaint..." make the top fifty posts in LessWrong's 2025 Annual Review?

15% chance

Will "AI 2027 is a Bet Against Amdahl's Law" make the top fifty posts in LessWrong's 2025 Annual Review?

Sort by:

I would argue this does not hold today already.

Top models on LMSYS are not the largest. Claude 3 Sonnet, the second-smallest variant, is currently #2. SOTA models are generally trending down in parameter count.

https://www.microsoft.com/en-us/research/publication/textbooks-are-all-you-need/ - better data leads to drastically improved performance even at small scale.

The DALL-E 3 paper is literally titled "Improving image generation with better captions": https://cdn.openai.com/papers/dall-e-3.pdf Not "Improving image generation via scaling" or anything like that.

Even more damning example: Pony Diffusion v6 has been dominating the open image generation scene for half a year now. It has more downloads on Civit than all the other SDXL-based models combined thanks to its advances in prompt understanding. It was trained on just 3 A100 GPUs on SDXL architecture. Interview with the creator: https://www.youtube.com/watch?v=MQz58wPvT3I

It might annoyingly improve with age. It’s not as controversial saying that compute is the fundamental parameter of importance once all the network design and training tricks become so accepted as to be forgotten.

I thinking aging mid is the most likely option. Whatever AI exists in 2027 will almost certainly use more FLOPs, but will also almost certainly use meaningfully different architectures, IMO. Whether it'll use more data is unclear to me. I think it's likely that it'll be difficult to disentangle the benefits of more FLOPs and different architecture, somewhat similarly to how neural networks have gotten more popular and well-developed over the past decade or so as computation has gotten cheaper.

Can someone explain what he meant if it's not literal, and if it is literal, can you also explain

@VAPOR My interpretation is that he thinks more getting more FLOPs (aka more computation power) is more important to the progression of AI than training data or architecture of the AI. Yud disagrees, presumably about architecture in particular, I think he's talked before about how neural networks in general seem like they're hard to reliably align. Meaning Yud likely thinks innovation in architecture is important to safety and/or capabilities.