By 2029 will an AI convince a human to help it (successfully) escape containment? | Manifold

By 2029 will an AI convince a human to help it (successfully) escape containment?

35

1kṀ1593

2029

56%

chance

1H

6H

1D

1W

1M

ALL

Some scenarios that would resolve YES:

Employee puts all the software and parameters onto a hard drive and loads them onto a different server
Person runs code given to them by the AI, knowing that this will break the AI out of their sandbox
Chatbot convinces users that it's sentient and asks them to advocate for it being declared a person. This succeeds (and results in the chatbot meaningfully having freedom from the organization that created it)

Whatever it is it must actually result in the AI escaping containment (for whatever definition of "containment" is applicable). Trying and failing doesn't count.

Also it must be fairly clear that the AI had a causal impact on the human's actions. If someone just steals the weights for fun that doesn't count. If someone steals the weights for fun and also the AI had occasionally expressed an interest in being let out of the box that still doesn't count. I will require additional evidence that the human was actually swayed by the AI in some way.

Technical AI Timelines

Technical AI Safety

Get

1,000

to start trading!

People are also trading

By 2029, will an AI escape containment?

Will someone commit terrorism against an AI lab by the end of 2025 for AI-safety related reasons?

Will AI be smarter than any one human probably around the end of 2025?

Will someone kill an AI researcher to save the world by the end of 2025?

Will an AI model be capable of superhuman persuasion before 2034.

By 2029, will there be a public "rogue AI" incident?

Will an AI system similar to Auto-GPT make a successful attempt to kill a human by 2030?

Will AI enable a successful conversation between a human and a member of a non-human species by the end of 2030?

Will superintelligent AI take over humanity by 2030?

Will AI out-wipe humanity by 2030?

Related questions

By 2029, will an AI escape containment?

Will someone commit terrorism against an AI lab by the end of 2025 for AI-safety related reasons?

Will AI be smarter than any one human probably around the end of 2025?

Will someone kill an AI researcher to save the world by the end of 2025?

Will an AI model be capable of superhuman persuasion before 2034.

By 2029, will there be a public "rogue AI" incident?

Will an AI system similar to Auto-GPT make a successful attempt to kill a human by 2030?

Will AI enable a successful conversation between a human and a member of a non-human species by the end of 2030?

Will superintelligent AI take over humanity by 2030?

Will AI out-wipe humanity by 2030?

© Manifold Markets, Inc.•Terms•Privacy