Will the OpenAI superalignment team believe that their goal has been achieved after 4 years?

1kṀ16k

resolved Jun 17

Resolved

ALL

https://openai.com/blog/introducing-superalignment

We need scientific and technical breakthroughs to steer and control AI systems much smarter than us. To solve this problem within four years, we’re starting a new team, co-led by Ilya Sutskever and Jan Leike, and dedicating 20% of the compute we’ve secured to date to this effort. We’re looking for e…

Technical AI Timelines

Get

1,000

to start trading!

🏅 Top traders

#	Name	Total profit
1		Ṁ695
2		Ṁ207
3		Ṁ197
4		Ṁ182
5		Ṁ100

People are also trading

Will we solve AI alignment by 2026?

2% chance

By the end of 2025, will OpenAI and Anthropic merge?

1% chance

Will OpenAI fold in 2025?

2% chance

Will Meta AI start an AGI alignment team before 2026?

34% chance

Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?

25% chance

Will OpenAI claim that it has achieved AGI in 2025?

6% chance

What will OpenAI do in 2025?

Will OpenAI implement any of the reform recommendations from The OpenAI Files by end of 2025?

48% chance

Conditional on their being no AI takeoff before 2050, will the majority of AI researchers believe that AI alignment is solved?

52% chance

Do OpenAI leadership actually believe they could develop AGI?

Sort by:

@SirCryptomind This market resolved early and without evidence. Please reopen it. The super alignment team leader is confident the outcome is the different than the way @VictorLJZ resolved this market.

@VictorLJZ i Bad resolution. The 4 years isn't up and the team leader is confident that OpenAI will develop safe AGI.

Resolved to NO since the team has been disbanded.

Should be NA since that was external decision that may or may not be political.

If most of the team gets replaced with new members, and those members believe their goal has been achieved, but the original now gone members disagree, how does this resolve? It seems like it should resolve Yes.

Just for clarification, “their goal has been achieved“ means that “superalignment is achieved“ or “alignment is achieved“, correct?

Something along the lines of “we have been successful in our project because it has shown us that alignment is much harder than we previously believed, etc…“ would not count, right?

(There's always a way to spin something to sound successful if one gains valuable new information from it. So I'm just asking to be sure.)

I'll just say that if our only criteria for whether we have reached alignment is "the team believes they succeeded", we have absolutely no hope of actually reaching it.