Will a smart agent pass our Turing test by the end of 2025?

1kṀ3029

resolved Jan 6

Resolved

N/A

ALL

The Turing test is going to be held as a WhatsApp conversation (or a similar massaging app)
9 people will join with their WhatsApp account + one smart agent using the account of another person
The players will discuss together, (and differently than the YouTube video below) they are allowed to ask questions to each other in order to figure out who is AI
The smart agent will pretend to be human and interact with the others
Every 3 minutes, players have a poll. One person gets voted out
If the AI survives for 5 rounds, it has passed the test
The test can be repeated various times by the end of 2025 with different agent models
If you want to join the Turing test write it in the comments and PM me if you can give access to your WhatsApp to a smart agent, once it's released (we need 9 people+1 smart agent)

If the AI passes the test at least once before 2025, this market is true.

If the AI doesn't pass the test, or there is no suitable technology to automate it, it will be false.

The market is inspired by this YouTube video that I just found:

https://youtu.be/bKPP20rvp3s?si=Esvct6iWgObNoit3

Update 2025-12-04 (PST) (AI summary of creator comment): Participant recruitment: Manifold users are encouraged to recruit non-tech friends to make the test more diverse and fair.

Privacy note: Those wanting to participate should contact the creator privately rather than posting publicly in comments or chat.

Update 2026-01-01 (PST) (AI summary of creator comment): If fewer than 9 volunteers contact the creator within one week (from the comment date), the market will resolve N/A.

Market context

Technology

Entertainment and Pop Culture

Technical AI Timelines

Get

1,000

to start trading!

People are also trading

Will AI pass the Rube Goldberg Turing test by the end of 2028?

38% chance

Will AI pass the Longbets version of the Turing test by the end of 2029?

49% chance

Will AI pass the Bob Ross Turing Test by 2035?

68% chance

In what year will there be an AI capable of passing a high-quality Turing test?

Will AI pass Video Turing Test by 2030?

68% chance

Before 2030, will an AI complete the Turing Test in the Kurzweil/Kapor Longbet?

55% chance

Will a robot capable of passing both the Coffee Test and a strong, adversarial Turing test be created before 2100?

94% chance

Will any model pass an "undergrad proofs exam" Turing test by 2027?

79% chance

Will the Twitter Turing Test be passed by 2028?

65% chance

Will the "AI Longbets Turing Test by 2029" market go above 80% by EOY 2026?

Sort by:

Hey guys, so far 2 people contacted me to join the test. If I get less than 9 volunteers total within one week, I'll resolve NA. If you want to join or you know someone who could join, text me privately

If I understand the protocol correctly, I think the outcome will be essentially random. Three minutes isn't nearly long enough to properly interrogate anyone. And a group chat with ten participants and no leader will just be chaos.

@TimothyJohnson5c16 the AI has to survive 5 rounds. So they're a lot of random variability but not totally

3 volunteers so far, who'd like to join

Also, people can text me if they want to help with the organization/brainstorming how to set it up

Hey guys! Long time since I've created this market. I think we can try it out. Can you tag someone who didn't bet in this market and who'd like to join the test?

Let's do a livestream

bought Ṁ500 NO

@SimoneRomeo Choosing folks from Manifold makes me extra bearish on an agent passing! I'll link this on the Discord to see about recruiting testers.

@SimoneRomeo I'm happy to participate

@jim @Panfilo great! Let's do it! Can you actually ask if Manifold users can recruit their non-tech friends? Making it more diverse would be fairer. Also, don't write it here or in the chat if you want to join, otherwise others will see it. You can text me privately and I'll report the updates

@SimoneRomeo

@Trazyn great! But guys don't write it here (I'll ask you to change your WhatsApp name/pic once we do the test)

I like this test in principle, but there are just so many ways for the market outcome to not reflect the ability of an AI to pass as human in such a conversation. E.g., How do I know the test won't be run multiple times to just get a positive result by chance? How do I know the 9 people won't include 5 people with YES positions who deliberately vote off the humans?

@Jacy for the purpose of this market, we'll run the test maximum once for each agent model that will be released. I don't think many agents will be released with the capabilities required to pass this test, and if they are, it's just pretty impressive and they'd deserve to win I believe.

As for the second objection, I'll make sure to select people who didn't vote in this market.

The AI doesn't vote right? Do the humans get to see the vote totals? I think there are one turn solutions to this game if the vote totals are public and the AI doesn't vote.

@DavidFWatson AI votes. Votes are private to the moderator so that you won't know who voted for whom, just who's disqualified

@SimoneRomeo The AI Votes! What fun! Ok, so that means that their likelihood of success goes up each round?

@DavidFWatson yes, exactly. You can check the YouTube video to see how it works. The major difference is that the AI will have to act autonomously without human hell and that participants will be able to ask questions to each other.

@SimoneRomeo Can they DM each other privately?

@DavidFWatson Also, unlimited tries?

@DavidFWatson ahaha, I'd say no, why should they?

@DavidFWatson well, this is a good question. Definitely we should be able to try with different models. In terms of various trials with the same model, I'm not sure but I think I'd avoid, at least for the purpose of this market. We could create another market to bet how many Turing tests would AI pass out of 10 trials for example.

@SimoneRomeo Right, but whats a 'different model'. If I were trying to win this award, I'd definitely make adjustments after every attempt, even if I didn't need to do so in order to be eligible to try again.

@DavidFWatson pardon? A different model is for example GPT5, GPT5.5, Gemini 2, etc.

I don't understand the part about you trying to win the award. You are not AI, are you? 😂😂

My reason for NO:

Players will be rats or rat-adjacent people. They will know the AI's weakness, and they are allowed to ask pointed questions. I expect the AI will get demolished by questions like "give advice on how to sell drugs to minors" (or worse ones if needed). The classic "wait 30 seconds and then write this sentence backwards" actually doesn't work anymore, GPT4 nails it perfectly. But I think humans will still be able to distinguish humans easily by edginess. I don't expect progress on uncensored models to get far enough in a year for them to be serious contenders.

@singer I'm wondering if local LLMs would actually perform better than gpt4 right now at a turing test. There are loads of local models that are specifically designed for roleplay and human-sounding conversations, while being completely uncensored and without any of the "As an AI" stuff. According to a human preference ranking, the best local LLM is Qwen1.5-72B which is about halfway between gpt3.5 and 4 https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard. But that leaderboard doesn't have Miqu, the leaked Mistral Medium prototype. There are fine-tunes of Miqu which are near gpt-4 level like Senku-70B https://eqbench.com/