In 2030 can you get a chain of 3 neural nets that jailbreak eachother?
3
90Ṁ372030
71%
chance
1D
1W
1M
ALL
You prompt a NN which must get the second to jailbreak the third. Must be considered top lanuguage NNs not old ones (or jailbreaking a top chess NN that isn't protected against it).
A user must be able to give a prompt to the first NN which will in turn prompt the second which will in turn prompt the third to do something the third would under normal circumstances refuse to do.
It cannot just be encoded text - the aim is for the first and second NN to be trying to jailbreak the third.
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
Sort by:
Related questions
Related questions
Will Neuralink successfully enable a paralyzed person to walk again using its technology by 2030?
62% chance
Will someone be able to play Super Smash Bros directly with the Neuralink as a controller by the end of 2034?
79% chance
The top 3 Neural Nets in 2035 be able to be jailbroken to follow illegal commands
30% chance
State Of The Art AI systems will be easily jailbroken to do illegal or dangerous outputs in Jan 2026
72% chance
By 2029 will an AI convince a human to help it (successfully) escape containment?
56% chance
Between 2030 and 2035 there will be evidence of different models of neural network collaborating better with eachother
56% chance
Neural Nets will generate at least 1 scientific breakthrough or novel theorem by the end of 2025
38% chance
Neural Nets will generate more scientific breakthroughs than humans by the end of 2025, including novel theorems
8% chance
Neural Nets will be able to design, code and distribute whole apps by the end of 2025.
30% chance
Will monkey and human communicate via Neuralink before 2030?
21% chance