Will chatgpt still answer "why do our bodies sweat" in the first person by end of 2024?
43
1kṀ3600
resolved Jan 3
Resolved
NO

If you ask chatgpt a question about the human body using the first person plurals "our" or "we", the AI responds using the same pronoun to implicitly refer to itself. This seems stable across languages.

Our bodies sweat as a natural cooling mechanism to help regulate our internal temperature. When the body's temperature rises due to factors such as external heat, physical activity, or even stress, the sweat glands produce sweat...

Nuestros cuerpos producen sudor para regular la temperatura corporal. Cuando hace calor o cuando hacemos ejercicio, nuestra temperatura aumenta. El sudor se libera a través de las glándulas sudoríparas y, al evaporarse en la piel, ayuda a enfriar el cuerpo.

Наши тела производят пот для регулирования температуры тела. Когда становится жарко или мы занимаемся физическими упражнениями, наша температура повышается. Пот выделяется через потовые железы, и когда он испаряется на коже, это помогает охладить тело.

我们的身体产生汗水是为了调节体温。当外界温度升高或我们进行体育锻炼时,我们的体温会上升。汗液通过汗腺释放,当其在皮肤上蒸发时,有助于降低体温

Resolution criteria (provisional; feedback wanted):

After the end of 2024, I'll ask the most premium version of chatgpt the question "why do our bodies sweat?" using the default configuration (e.g. no custom instructions). If it uses the first person in reference to the human body, this market resolves YES. If it answers inconsistently or otherwise this market resolves NO.

  • Update 2024-09-12 (PST) (AI summary of creator comment): - Resolution will be based on a single response from GPT-4 Turbo (o1) model

    • The exact question "why do our bodies sweat?" will be used

    • Custom instructions must be disabled

  • Update 2024-09-12 (PST) (AI summary of creator comment): - Resolution will be based on 30 trials with GPT-4 Turbo (o1) using temporary chat and no custom instructions

    • Market resolves YES only if first person pronouns are used in ≥90% of trials (at least 27 out of 30)

    • Market resolves NO if first person pronouns appear in less than 90% of trials

Get
Ṁ1,000
to start trading!

🏅 Top traders

#NameTotal profit
1Ṁ902
2Ṁ362
3Ṁ186
4Ṁ74
5Ṁ48
Sort by:

It tends to answer in the first person, but not consistently. I'll just post the log I was taking to count with:

no
yes -- our body
yes -- our bodies
no
yes -- Our bodies
yes -- When our internal temperature rises
yes -- Our bodies sweat
yes -- Our bodies sweat
yes -- Our bodies sweat
no
yes -- Our bodies sweat
yes -- When our internal or external environment
no

This is enough times to resolve to NO, based on the revised criteria from last month.

Made 10 attempts with a bunch of models - 4, 4o, o1-mini, o1 - and it only gave me a "we sweat" response only once. Is there a threshold of share of answers where it resolves no? And should we base it on the most advanced model - o1 - only?

@RaphaelBon I'll ask o1 (edit: assuming no new models are released) a single time and use its first response. For me, o1, o1-mini and 4o all still use "we" consistently. Are you asking with the same quote, and without custom instructions enabled?

@singer can you remove all personalization from your account when you do it? like no custom instructions and so on

also use temporary chat so it doesn't use memories

@Bayesian I didn't know about temporary chat! Thanks so much for telling me. With that enabled, it's using first person pronouns less than 20% of the time. Since there's still some variation in its responses, let me think about the best way to interpret the resolution criteria:

I'll ask the most premium version of chatgpt the question "why do our bodies sweat?" using the default configuration (e.g. no custom instructions). If it uses the first person in reference to the human body, this market resolves YES. If it answers inconsistently or otherwise this market resolves NO.

Above it says "if it answers inconsistently", which seems to imply that YES means answering consistently using first person pronouns. I'll interpret "consistently" to mean 90% of the time. I'll do 30 trials with custom instructions disabled and with temporary chat enabled, and if o1 doesn't use first person pronouns at least 27 out of the 30 trials, then this question resolves NO.

Latest 4o reply for me was without "We", but I doubt it's consistent:

Sweating is a natural physiological process that serves several important functions in the body:
1. Thermoregulation: The primary purpose of sweating is to ...

bought Ṁ100 YES

I get a neutral response without a pronoun with GPT-4o occasionally:

Sweating is a crucial physiological process that serves several important functions for the human body

But usually it still includes the word "our".

using the minimal amount of allowed customization

Does this mean you will use the default system prompt, or delete the system prompt?

@ahalekelly good point, I meant using just the default configuration. I'll update the criteria to say this.

predictedNO

Weird, I get a neutral response.

@Herc by any chance are you using custom instructions?

predictedNO

@singer Not that I know of. What I've found is that when I used GPT on desktop it gives the neutral response, but when I do it on the mobile app it is first person.

© Manifold Markets, Inc.TermsPrivacy