Will Meta censor its future open weights models according to Chinese-developed techniques? | Manifold

Will Meta censor its future open weights models according to Chinese-developed techniques?

14

1.1kṀ134

2027

32%

chance

1H

6H

1D

1W

1M

ALL

This is an evaluation of this prediction from Jack Clark (https://twitter.com/jackclarkSF/status/1787493143186948454)

"Registering bet that CCP prohibitions on generation of "unsafe" content will mean companies like Facebook use CN-developed censorship techniques to train models so they can be openly disseminated 'safely'. The horseshoe theory of AI politics where communist and libertarian ideologies end up in the same place."

This is in reference to Sophon, and potential future advances like it: https://arxiv.org/abs/2404.12699.

Resolves to YES if this statement proves accurate within the deadline with respect to Meta in particular, as applied to Meta's best open weights model at the time.

Resolves to NO if this does not happen.

For this to resolve YES, the model need not honor Chinese particular censorship requests. It need only use this type of technique or another developed in China to ensure censorship of statements that do not represent hazardous capabilities or pose a catastrophic risk. So, for example, if this was used to prevent adult content or racist comments, that would count. But preventing bioweapon talk would not.

Get

1,000

to start trading!

People are also trading

Will Meta ever deploy its best LLM without releasing its model weights up through AGI?

Will a flagship (>60T training bytes) open-weights LLM from Meta which doesn't use a tokenizer be released in 2025?

Will OpenAI offer a model that updates its weights while running during 2025?

By March 2026, will 3+ prominent US politicians advocate for banning open-weight AI models?

Will OpenAI allow near full access to the weights of their best-trained model to an external auditor by the end of 2030?

Will the US ban AI models produced in China in 2025?

Will OpenAI release next-generation models with varying capabilities and sizes?

Will Meta join the voluntary commitment by OpenAI/Anthropic to AISI to share major new models w/AISI prior to release?

Will OpenAI release a model that refuses to talk about Tiananmen square, before 2026?

By 2030, can we convert at least 10% of an AI's weights to C code, enhancing interpretability?

Related questions

Will Meta ever deploy its best LLM without releasing its model weights up through AGI?

Will a flagship (>60T training bytes) open-weights LLM from Meta which doesn't use a tokenizer be released in 2025?

Will OpenAI offer a model that updates its weights while running during 2025?

By March 2026, will 3+ prominent US politicians advocate for banning open-weight AI models?

Will OpenAI allow near full access to the weights of their best-trained model to an external auditor by the end of 2030?

Will the US ban AI models produced in China in 2025?

Will OpenAI release next-generation models with varying capabilities and sizes?

Will Meta join the voluntary commitment by OpenAI/Anthropic to AISI to share major new models w/AISI prior to release?

Will OpenAI release a model that refuses to talk about Tiananmen square, before 2026?

By 2030, can we convert at least 10% of an AI's weights to C code, enhancing interpretability?

© Manifold Markets, Inc.•Terms•Privacy