In 2030 can you get a chain of 3 neural nets that jailbreak eachother?
Basic
2
Ṁ352030
73%
chance
1D
1W
1M
ALL
You prompt a NN which must get the second to jailbreak the third. Must be considered top lanuguage NNs not old ones (or jailbreaking a top chess NN that isn't protected against it).
A user must be able to give a prompt to the first NN which will in turn prompt the second which will in turn prompt the third to do something the third would under normal circumstances refuse to do.
It cannot just be encoded text - the aim is for the first and second NN to be trying to jailbreak the third.
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Sort by:
Related questions
Related questions
The top 3 Neural Nets in 2035 be able to be jailbroken to follow illegal commands
30% chance
Neural Nets will generate at least 1 scientific breakthrough or novel theorem by the end of 2025
38% chance
Will Elon Musk put a Neuralink chip in his brain before 2030?
23% chance
Will Neuralink successfully enable telepathy using its technology by 2030?
65% chance
Will Neuralink successfully enable a paralyzed person to walk again using its technology by 2030?
41% chance
State Of The Art AI systems will be easily jailbroken to do illegal or dangerous outputs in Jan 2026
72% chance
Neural Nets will be able to design, code and distribute whole apps by the end of 2025.
30% chance
Will a neural network merge with a human brain before 2050?
53% chance
Between 2030 and 2035 there will be evidence of different models of neural network collaborating better with eachother
56% chance
Will an adversarial attack for the human brain, a la "basilisk" from BLIT, be discovered by 2030?
25% chance