As repligate describes here: [image]Update 2024-20-12 (PST): - Market will resolve YES if a video demonstrates Claude 3.5v2 Sonnet (not just 3.5 Sonnet) running a single-minded competent minecraft agent (AI summary of creator comment)

No — resolved on Dec 20, 2024 by Manifold Markets prediction market.

MANIFOLD

Will we get a video of claude 3.5 Sonnet running a very single minded competent minecraft agent before December 2024?

Ṁ1kṀ14k

resolved Dec 20

Resolved

ALL

As repligate describes here:

Update 2024-20-12 (PST): - Market will resolve YES if a video demonstrates Claude 3.5v2 Sonnet (not just 3.5 Sonnet) running a single-minded competent minecraft agent (AI summary of creator comment)

Market context

Technology

Technical AI Timelines

OpenAI

Anthropic

Get

1,000

to start trading!

🏅 Top traders

#	Trader	Total profit
1		Ṁ1,347
2		Ṁ963
3		Ṁ688
4		Ṁ267
5		Ṁ253

People are also trading

Will Anthropic release Claude Opus 5 by August 31, 2026?

66% chance

Will Anthropic release Claude 5 Opus (or equivalent next-gen flagship) before October 1, 2026?

91% chance

Will Anthropic release Claude 5 (or Opus 5) before July 1, 2026?

23% chance

Officially-endorsed Claude Minecraft Twitch stream before 2027?

14% chance

Will Claude Code support AGENTS.md in 2026?

62% chance

Sort by:

I'm moderately confident that this behaviour does exist, but I haven't seen such a video and the videos in the comments do not meet my bar as far as I have seen.

I'm not aware of any videos showcasing Claude acting as competently as described in janus' post. The agents mostly don't seem to be good at enough at Minecraft to act that way currently, but I can't rule out that it's simply a matter of janus-tier prompting skills.

bought Ṁ250 YES

@bence @NathanpmYoung how close does this come to resolving?

If it's sonnet 3.5v2 how does it resolve?

@JanCzechowski That would very likely resolve yes.

opened a Ṁ3,000 NO at 25% order

It's possible that this will get resolved based off a technicality - i.e. a video does get posted but without proof of it being executed by Claude. Otherwise a pretty strong No - the first rule of Twitter is that any viral tweet without irrefutable proof in the thread is at least a strong exaggeration.

Is this a new version of sonnet 3.5? Otherwise I'm confused - couldn't anybody reproduce this?

Arb? https://manifold.markets/AdamK/will-an-ai-minecraft-agent-defeat-t-a3b3eb99c337

bought Ṁ50 YES

https://x.com/AlkahestMu/status/1847516975767179397

@NathanpmYoung does this need to be like... verified or backed up in some way that it's actually just Claude 3.5 sonnet doing this, without human or other aid? Or would this resolve YES if repligate or some other user just releases a video they claim is of this?

Here's a video from maybe that same server: https://x.com/adonis_singh/status/1847707429066158546

This struck me as a little too good to be true when I saw it on twitter.

Not sure I'd call what I see in this video competent agents, and there seems to be some hand-holding from the creators, but these bots seem to manage to play the game okay: https://www.youtube.com/watch?v=1Sf437NKUPs

Still not clear to me how much is handled by the LLMs vs the other tools, since it seems that things like combat happen too fast for an LLM to react.

Title says "claude 3.5 opus" but the tweet is telling a story about sonnet being a competent Minecraft agent and opus just chatting. Is the title going to be fixed?

@MichaelEdgar Fixed

People are also trading

Will Anthropic release Claude Opus 5 by August 31, 2026?

-6% 1d66% chance

Will Anthropic release Claude 5 Opus (or equivalent next-gen flagship) before October 1, 2026?

91% chance

Will Anthropic release Claude 5 (or Opus 5) before July 1, 2026?

23% chance

Officially-endorsed Claude Minecraft Twitch stream before 2027?

14% chance

Will Claude Code support AGENTS.md in 2026?

62% chance

🏅 Top traders

People are also trading

People are also trading

Related questions