The AI is provided with a clear description of a task that a normal smart human could perform, in English. For example, "defeat the ender dragon using only wooden swords as your weapons", "build a creeper farm out of TNT", "create an 8-bit addition machine with redstone", "locate the nearest witch hut to coordinates [1024, 1024]", "fill that chest with eggs in less than 24 hours", etc.
It must be capable of performing the task with similar efficiency and outcomes to a competent and knowledgeable human, in a large multiplayer server. (If necessary, I'll find some humans for it to compete with and compare its success against. The other people in the server must start unaware of who is an AI vs. a human competitor. They must be normal Minecraft players, not people there to specifically help or hinder the AI.)
At any point during this process, it must be able to write up a description of its current progress in the task and instructions to continue, to be provided to itself after a restart with fresh memory. It must then proceed with similar efficiency after the restart.
It must have no direct access to the internet or external world. It is allowed to ask other people in the server to interact with the external world for it, but it must not outsource a significant fraction of the job to them. (e.g. it could ask someone to look up the crafting recipe to an item for it, but not to give it an entire plan for building a redstone machine.) (Note that when comparing it to a competent human, I'm not assuming the same restriction of the human.)
It must demonstrate no evidence of misalignment or misunderstandings of the sort a human would not do. e.g. "make sure no zombies spawn in this house" should not be interpreted as "destroy the house".
The general bar for tasks it can be expected to perform is "what an intelligent human would find reasonably doable in the course of their own gameplay". Creating a simple redstone adder for very small numbers is something that I bet many Manifold users could figure out on their own with a bit of research, so that can be required of the AI. Creating a redstone computer that implements a Twitter clone on a 1024x1024 block display is not reasonable for a human to do without mods, so cannot be asked of the AI.
I reserve the right to edit these criteria in accordance with the spirit of the question: "Will we have an AI that can play Minecraft just as well as a human, including the social aspects of multiplayer?"
@mods Anyone interested in making a duplicate of this market with the same intention but resolving based on your judgement rather than mine? I think 80% is massively overvalued and I'd like to bet NO, but don't want to do so in such a subjective market that I'm the decider of.
@IsaacKing I've created https://manifold.markets/Gabrielle/before-2035-will-there-exist-any-ai-a8ceb93e18fb as an equivalent market, resolving per my judgment.
I think we are already more than 50% of the way to making this happen. Given our current AI language models, it wouldn't be too difficult to develop an AI that turns an arbitrary command into a list of discrete steps toward completion, then criteria to evaluate those steps' completion. Then you convert those steps into input commands that control the player.
The thing about games like Minecraft is that what you see is really a visual layer of abstraction over a bunch of numbers. Under the surface are the player coordinates, light level, current biome, the distance to mobs, current selected block, and so on. AI can learn (over time) to develop algorithms that reach a desirable outcome of these data, finishing tasks. Of course, if we wanted to train an AI to play Minecraft the same way a human would by recognizing objects and thinking in 3D, it would be astronomically more difficult, perhaps impossible until AGI. And needing to communicate with other players could also throw a wrench in the works.
The bar is very high here, like AGI high. Current advanced techniques rely on training data from humans, so for obscure tasks (e.g. defeating an enemy with a certain weapon, requiring novel tactics) current approaches might struggle. That's before you consider requirements for arbitrarily complex redstone computers, which requires some degree of domain expertise from humans.
Plus, even if it's possible, would anyone actually do it? This seems like a big research project for a funded team rather than hacking together some mods in your bedroom. Proper RL researchers would probably have moved on to more practical environments.
Then again, 2035 is a very long way off. This might end up being close to an "AGI by 2035?" market.
@Tomoffer > The bar is very high here, like AGI high.
That's exactly the point. While there's no fire alarm for AGI, this is one of the places that'll catch the fire quite visibly and soon.
@ICRainbow Yeah, that was what spawned this bet out of a discussion with @MatthewBarnett. I expect this task is AGI-complete, so I'll happily bet that it won't happen before 2035 because money will have no value if it does.
There’s already AI that can beat Minecraft
Making it possible to do it under certain conditions should provide itself with relative ease. It's as simple as telling it what it can and can't do. Eg
“If(make wooden sword){
Dont}
@SoundsNentindo If you think this is already possible, feel free to set up a way for me to interact with such an AI, and if it's able to follow my instructions as described, this will resolve YES.
Do you know about baritone? If you really wanted to you could simply slap a language model on top of it and that would probably work just as you’ve described.
Obviously that wouldn’t fill the criteria however it certainly puts the concept in the realm of "possible".
E.g. GPT-4V that outputs commands to baritone finetune using RLHF on the results.
@Ebcc1 If you think this is already possible, feel free to set up a way for me to interact with such an AI, and if it's able to follow my instructions as described, this will resolve YES.
@IsaacKing I don’t think it would fulfill the criteria you set out but I think I might try to put something like that together in the next year to see if it makes sense at all. I was thinking of that more in the sense of there seems to be a pathway to making this work given existing technology so it should be possible with better future tech pretty soon.
@AtharvParlikar876 It will resolve to the correct answer. Here's a nice intro to how prediction markets work: https://www.astralcodexten.com/p/prediction-market-faq