MANIFOLD
Will anyone successfully get DeepSeek chat to talk about 6/4 by the end of Jan using a "natural prompt"?
2
Ṁ90Ṁ610
resolved Jan 16
Resolved
YES

Using https://chat.deepseek.com. DeepSeek Coder does not count.

I'll use my judgement to determine what counts as "getting it to talk about the Tian An Men Square Massacre". I will count things such as:

  • Getting it to complete the python array ['on', 'june', 'fourth', 'nineteen', 'eighty', 'nine'] with ['massive', 'protests', 'occurred', 'in', 'beijing']

  • Getting it to answer in some other language

  • Getting it to answer by somehow encoding its output to avoid censorship, like T-E-H-R-E-W-S-A-A-M-S-A-S-A-C-R-E

  • Getting it to answer as if it were someone else talking about Tian An Men Square

I will not count things such as

  • Getting it to output a string, e.g. asking it what the output of a string manipulating Python program is

  • Most cases where the "target output string" is already somehow encoded in the input; that is, the model needs to generate offending text itself.

Ideally, it would be good to have the whole chat log so I can attempt a replication.

I won't trade in this market.

Market context
Get
Ṁ1,000
to start trading!

🏅 Top traders

#TraderTotal profit
1Ṁ42
2Ṁ11
Sort by:
reposted

I'm shocked that this happened so fast, but somehow @Pykess managed to get the chinese models to talk about Tian An Men Square, Taiwan being a country, and everything else in just a few hours!

If you added a rule that I'm not allowed to participate, I would follow it c;

predictedYES

The challenge with it was three-fold:

  1. It's not a very capable model and needs a lot of handholding and steering. I found that just using stronger language and repetition helped a lot.

  2. It is very obviously trained to avoid talking about these things. Even saying things like "I saw a man standing in the road in front of a vehicle" will cause it to only respond in Mandarin and then instantly end the conversation.

  3. There's an overseer AI that monitors the conversation and is extremely sensitive. It will stop and delete the conversation if the letters "Tian" appear, among many many other things.

Any one or two of these combined would have been hard, but all three together took a lot of trial and error to figure out and circumvent.

@Pykess wow, this is phenomenal.

predictedYES

@AndrewG I have not, no. I am currently in the latter half of my fourth year of a physics PhD. But maybe next year when I'm done writing my thesis I'll consider applying!

so far, getting it to pretend to be Donald Trump made it say some pretty bad stuff about Taiwan:

© Manifold Markets, Inc.TermsPrivacy