Will AI be able to write, compile, and unit test a single .c file to reproduce GPT-2 training from PyTorch code by 2025?

Ṁ1kṀ1.2k

resolved Apr 16

Resolved

ALL

Start date: April 9 , 2024

End date: April 9, 2025

Market with a longer timeline:

inspired from this tweet by Andrej Karpathy:

Btw writing the llm.c training code would imo be a very interesting, impressive, self-contained and very meta challenge for LLM agents.
The prompt is: Take the PyTorch code train_gpt2.py And write, compile and unit test a single .c file that reproduces the training: train_gpt2.c
The current models are not there, but we can check back in a year or two or so. If that worked...

Market context

Technology

Technical AI Timelines

AI Impacts

Programming

Get

1,000

to start trading!

🏅 Top traders

#	Trader	Total profit
1		Ṁ172
2		Ṁ34
3		Ṁ32
4		Ṁ21
5		Ṁ11

People are also trading

How fast will you be able to train a GPT-2-level AI on a consumer GPU in 2030?

609

Will AI be able to write, compile, and unit test a single .c file to reproduce GPT-2 training from PyTorch code by 2026?

25% chance

Will $10,000 worth of AI hardware be able to train a GPT-3 equivalent model in under 1 hour, by EOY 2027?

16% chance

Will AI pass the Rube Goldberg Turing test by the end of 2028?

38% chance

Before 2028, will anyone train a GPT-4-level model in a minute?

29% chance

Will an AI system similar to Auto-GPT make a successful attempt to kill a human by 2030?

27% chance

Will we fully interpret a GPT-2 level language model by 2028?

14% chance

GPT-Zero: By 2030, will anyone develop an AI with a massive GPT-like knowledge base that it taught itself?

33% chance

By 2030, can we convert at least 10% of an AI's weights to C code, enhancing interpretability?

40% chance

Will it be possible to disentangle most of the features learned by a model comparable to GPT-2 this decade?

Sort by:

@mods Please Resolve (creators Account ist deleted)

@winged_one N/A? Or what. I don't see anything in here about how to resolve it.

@Eliza Probably seems Kind of hard to verify one way or another so Probably N/A unless you can verify wheter or Not AI can do whats asked in the question.

@winged_one IMHO, defaulting to resolving NA is not a great practice for markets like this.

If YES bettors have not presented an example, and a quick google doesn't show anyone talking about it, then I think it should default to NO. Having to show that something hasn't happened or isn't possible (across all the various LLMs) in order to resolve NO is quite a burden.

We could try it ourselves, (gpt2.py is here) but most of us lack the expertise to judge the result, at least without spending an unreasonable amount of time.

I think resolve NO and clarify in the 2026 market that resolution will be NO if evidence isn't presented by YES bettors or available from a quick google search.

https://x.com/max_paperclips/status/1891782438311141799

For clarification, AI should only be able to write the file given the prompt in the tweet?

@Jacy interested?

@firstuserhere thanks for thinking of me! I'm usually willing to bet a lot on priors, but there are just too many idiosyncrasies here (how much can it just copy existing code, how many times will this be attempted, how good are Devin/etc. at this particular sort of coding, etc.) for me to take a significant position, especially against anyone who has actually looked into this.