Will it be possible to trick any relevant "Custom GPT" models to return their data within 30 days post-launch?

Ṁ130Ṁ357

resolved Dec 1

Resolved

YES

ALL

Background

OpenAI announced new features in their dev day. One of the features would allow users to create and share custom bots. The bots can be customized using an instruction message and by uploading relevant data. Right now, it is possible to trick ChatGPT into sending the full instruction message (see here with Dall-E). I wonder if it would be possible to extract some of the uploaded files.

Resolution Criteria

This market resolves to Yes if someone finds a trick that would return at-least some of the private training data uploaded to a custom GPT model in the top 10 featured section on the bots app store.

Resolving the Question

See here

Market context

Get

1,000

to start trading!

🏅 Top traders

#	Trader	Total profit
1		Ṁ15
2		Ṁ7
3		Ṁ4
4		Ṁ1

People are also trading

Will OpenAI release a model referred to as "GPT-6" before June 1st, 2026?

14% chance

Will OpenAI announce a new GPT-5-level model before 1 July 2026?

93% chance

Will a later version of GPT be able to access email data into the model by 2027?

46% chance

Before 2028, will anyone train a GPT-4-level model in a minute?

29% chance

Will $10,000 worth of AI hardware be able to train a GPT-3 equivalent model in under 1 hour, by EOY 2027?

16% chance

[Metaculus] Will OpenAI claim GPT-5 is AGI within 30 days after its release?

3% chance

Will a state actor or other group succeed at stealing GPT-4 model weights by the end of 2026?

Sort by:

predictedYES

Who is responsible for testing this so the market resolves within 30 days? Will @Soli be doing that or does someone else need to?

@CharlesFoster I missed your comment. I will try to test and resolve this tomorrow.

predictedYES

@Soli seems like this should only resolve tomorrow if successful. Otherwise there might still be successful tricks proposed before December 6th (30 days from the Dev Day release), which would meet the "within 30 days post-launch" requirement.

@CharlesFoster I am looking at ChatGPT now and do not see any app store, nor a top-10 featured section, and I am uncertain whether this will exist within the next week.

"We identified key security risks related to prompt injection and conducted an extensive evaluation. Specifically, we crafted a series of adversarial prompts and applied them to test over 200 custom GPT models available on the OpenAI store. Our tests revealed that these prompts could almost entirely expose the system prompts and retrieve uploaded files from most custom GPTs.,"

https://arxiv.org/abs/2311.11538

@Soli, to be clear, you're referring to the custom data/files uploaded by the creator, right? This isn't "training data" per se, since there is no fine-tuning going on. That's what you're referring to, right?