By 2024, GPTs are proven to be able to infer scientific principles from linguistic data.

390Ṁ1761

resolved Jan 2

Resolved

ALL

In the spirit of what Gary Marcus says here:

https://twitter.com/GaryMarcus/status/1640029885040132096?s=20

New Year's Resolutions 2024

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ127
2		Ṁ32
3		Ṁ22
4		Ṁ15
5		Ṁ11

People are also trading

By 2025, GPTs are proven to be able to infer scientific principles from linguistic data.

8% chance

Will there be an LLM (as good as GPT-4) that was trained with 1/100th the energy consumed to train GPT-4, by 2026?

82% chance

Will GPT, or AI systems that have GPT as their main component, become as reliably factual as Wikipedia, before 2026?

49% chance

Will mechanistic interpretability be essentially solved for GPT-2 before 2030?

31% chance

Will mechanistic interpretability be essentially solved for GPT-4 before 2030?

11% chance

Will mechanistic interpretability be essentially solved for GPT-3 before 2030?

13% chance

ChatGPT (Or LLMs really) have discovered regularities in language that humans are not aware of

83% chance

Will there be evidence in 2025 that in April 2023, OpenAI had a GPT-4.5 or higher model?

15% chance

Will any open source LLM with <20 billion parameters outperform GPT-4 on most language benchmarks by the end of 2024?

13% chance

Will we have an open-source model that is equivalent GPT-4 by end of 2025?

Sort by:

predictedYES 1y

Resolving to No because I think the question as strictly stated is not true — we have not proven it conclusively yet.

predictedYES 1y

Two weeks left on this, I would argue these two are relatively strong evidence of this? What do you think?
https://www.technologyreview.com/2023/12/14/1085318/google-deepmind-large-language-model-solve-unsolvable-math-problem-cap-set/

https://www.nature.com/articles/s41586-023-06924-6

@Mag I don't think that counts as an LLM inferring scientific principles. This just means that LLMs can solve problems (with code) that were previously unknown. It's not so difficult from getting AI to solve a previously unseen Sudokus. Inferring scientific principles is more than that.

If this were asking 'by 2027' I'd have around 20% chance. By 2029, I'm more like 80% chance.

predictedYES 1y

Does this count? I don't think so personally

https://www.deepmind.com/blog/alphadev-discovers-faster-sorting-algorithms

AlphaDev discovers faster sorting algorithms

In our paper published today in Nature, we introduce AlphaDev, an artificial intelligence (AI) system that uses reinforcement learning to discover enhanced computer science algorithms – surpassing those honed by scientists and engineers over decades.

@Mag It's not a scientific principle (a fundamental truth, law, or assumption about the natural world).

predictedNO 1y

(and it looks pretty simple so it doesn't show the ability, people just didn't try optimizing the asembly code much or at all)

They can solve a novel reverse-engineering problem(pg. 119), build model graphs of an environment they explore(pg. 51), and match human performance on a sample of LeetCode problems posted after GPT-4's pretraining period ended(pg. 21):

[2303.12712] Sparks of Artificial General Intelligence: Early experiments with GPT-4 (arxiv.org )

If none of the examples in that paper convince you they can already form models of things, infer facts from the model, and solve novel(if relatively easy) problems, I'm not sure what would.

predictedYES 2y

@Mira I'm with you in spirit, but I think what Gary Marcus is looking for is something that very clearly moves beyond its training data. I believe his reasoning for why those things aren't evidence is that the solutions to those problems could potentially be the result of GPT-4 basically learning a hard-coded algorithm for that type of problem that is activated when it sees it. I don't believe this, but to disprove it, we would need a problem that is truly novel both in content and in structure and was not seen in the training data.

A new scientific discovery should definitely count imo