When will a robot reliably pass "The Coffee Test"?
Before 2025
Before 2026
Before 2027
Before 2028
Before 2029
Before 2030

"The Coffee Test" is an alternative to Turing test proposed by Steve Wozniak. It goes as follows:

A machine is required to enter an average American home and figure out how to make coffee: find the coffee machine, find the coffee, add water, find a mug, and brew the coffee by pushing the proper buttons. This has not yet been completed.

The test has to be repeated several times under independent control. (I.e. self-published video by Boston Dynamics is not enough.) The success rate has to be at least 50% out of at least 3 attempts in different houses.

The market will be resolved positively after a month has passed since the successful demonstration to give some time to uncover possible cheating.

Related question with a different time scale:


I do not bet on my own questions.

Get Ṁ600 play money
Sort by:

Robot enters home, uses it’s built in coffee machine to make a cup of coffee. Easy.

bought Ṁ70 Before 2026 NO

This feels to me similar to the challenge of self-driving cars: Theoretically possible long before practically possible.

Interestingly, it's very likely that a capable robot will exist for a while before this market closes. This test might not be well known enough for someone to bother to run it.

@ProjectVictory The test is relatively well-known, so if the robot exists as more than a single prototype, I expect somebody to try it out. They might only try just one house, which will not meet the criteria, but then we can wait for somebody else to repeat the test.

@ProjectVictory If you think that such robot already exists and it fully meets the market criterias, you should bet on "Before 2025" and take a profit when it confirmed.

@bessarabov To be fair, it has to not just exist, but to be tested in a particular way. Just the existence of a robot capable of doing it is not enough for the resolution.

Test from 14 years ago:

Some of the more modern coffee machines are quite simple to operate.

For example, does the above require the ability to manage an inexpensive italian press?

The intuition behind the idea was less 'do this specific task' and more 'generically be wise about the world'.

@gpt_news_headlines I would say an "average American home" has either a drip machine or a Keurig.

@gpt_news_headlines Multimodal models are already generically wise about the world in this way. The obstacle is dexterity, and that's all.

@HarrisonNathan It's an interesting point. If it's an issue of dexterity than perhaps there is a test which passes that which might be more relevant.

bought Ṁ3 Before 2030 NO

@HarrisonNathan can you name one such model? Not trying to argue, genuinely interested.

@ProjectVictory There are no models which are wise about the world to a particularly deep resolution. It's more like, ok, I know how to make coffee, I know about the coffee pot, the filter, where it goes, I need to put coffee grounds in it, put water in, and how much, etc. Fairly high level wisdom.

I mean, try out claude and gpt4. They can practically do all of this.

What there needs to be is a dextrous robot which has deep object recognition capability and manipulation. That part is missing. There is some connective models between the LLM layer and the robot layer, but unclear what's the best approach.

As a human (probably), I don’t know if I could pass the coffee test

@Vik Well, AIs could also play Chess and speak various languages better than us.

More related questions