When if ever will unrestricted GPT-4/Grok 3/DeepSeek-R1 level models result in at least one WMD fatality?
2
1kṀ942
Jul 1
50%
2029-2030
50%
2031-2035
50%
After 2035
50%
Only after superintelligence is achieved
45%
2027-2028
25%
2026
15%
July-Dec 2025
10%
Feb-June 2025

Inspired by this Twitter/X thread, but others have expressed similar concerns.

"WMD" is here defined as a nuclear, radiological, chemical, or biological weapon. The attack does not need to be successful, just result in at least one death (e.g. the aspiring terrorist accidentally gasses themselves.)

"Unrestricted" is here defined as any LLM that will help with planning a WMD attack, whether because it was not designed to refuse such a request, was verbally persuaded/jailbroken to do so, was fine-tuned by someone to do so, etc.

I'll try to be generous regarding whether access to the LLM "resulted in" the attack. Unless it's extremely clear that the perpetrator(s) could have accomplished it on their own (e.g. a professional chemist using an LLM to speed up ordering glassware for their chemical weapons project), any use of an LLM will resolve YES.

I've left the ability to add options open if anyone feels these bands are too wide.

Get
Ṁ1,000
to start trading!
© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules