Will Model Intelligence diverge from consumer market share?

Question

Context:

Currently, it looks like in raw intelligence

Gemini 3.1 Pro (Google) > GPT-5.4 (OpenAI) > Claude Opus 4.6 (Anthropic)

yet in terms of market share / usability

Anthropic (Claude Code) > OpenAI (Codex) > Google (Antigravity / Gemini CLI)

this suggests there’s an inherent tradeoff between effort put into improving the underlying model and the integration with the harness

where Google (and OpenAI) focus more on  tasks like multimodality (image / video gen) long context, math, chess and hope the other capabilities come down stream of that increased intelligence

Anthropic is much more focused on coding / SWE and the tools (MCP / Skills / Claude Code) to enable training data specific to the domain they’re hill climbing on, they’ve also made Claude much more humanlike in behavior with things like soul documents and Claude seems to have by far the best non-reasoning models as well and generally more jailbreak resistant (Google is by far worst at that)

Context:

Recent interview with OpenAI Chief Scientist:

https://youtu.be/vK1qEF3a3WM?si=Mc9hOdHrHjO9msII

[image]Anthropic:

https://x.com/rohanpaul_ai/status/2043008591242891315?s=20

[image]Gaining market share rapidly amid DoD drama, reportedly increased from 9B ARR run rate last year to 30B just 3 months into 2026 (https://finance.yahoo.com/news/anthropic-tops-30-billion-run-221045473.html )

Already poised to overtake OpenAI in valuation (918B vs 868B on ventuals)

https://manifold.markets/EricNeyman/when-will-anthropic-surpass-openai

Mythos seems like a sign Anthropic might have taken the lead once and for all at least in coding, but too early to tell whether the lead in coding translates to other domains they’re clearly behind in (e.g. math, world models)

Google:

Google is definitely focusing more on “Personal Intelligence” and their models historical suck at agentic coding due to a tendency to attend to all the context at once which makes it require hacks to be able to attempt to use tool calls for long running tasks

https://youtu.be/TilDSWeiAlw?si=hBFl8-Wm2bSrsGqy

[image]Relevant benchmarks

ECI (and all the specific benchmarks going into that, namely seems like Frontier math and chess are what make OpenAI and Google ahead of Anthropic) https://epoch.ai/eci/?view=graph&tab=release-date

[image]OpenAI: 156/158

Google: 157

Anthropic: 155

AA Intelligence (another aggregator focusing on one shot tasks which where Google and OpenAI seem to perform better): https://artificialanalysis.ai

[image]Google: 57

OpenAI: 57

Anthropic: 53

Benchmarks that favor Anthropic:

Versions of SWE bench that focus on long running / code churn:

can’t seem to find the exact link - https://www.lesswrong.com/posts/nAMhbz5sfpcynjPP5/swe-bench-pro-is-even-worse, I saw in some recent video

maybe it was evoclaw: https://arxiv.org/pdf/2603.13428

or actually SWE-CI was probably what I was looking for: https://arxiv.org/pdf/2603.03823

[image]Arena.ai: https://arena.ai/leaderboard/text

Ok enough background

will resolve in a year with possibility to extend to another year if it’s unclear (kind of like now)

Yes = the trend is proven correct where Anthropic seems to pull ahead due to sole focus on code / enterprise consumer whereas OpenAI and Google’s focus on general intelligence / research / frontier causes them to lose market share to Anthropic who achieve some kind of special sauce focusing on their ecosystem. Essentially yes = in the next 1-2 years Anthropic seems to have some moat (kinda like Apples walled garden ecosystem vs Android) even though benchmarks seem to say other models are more intelligent (hopefully the inverse Goodharts doesn’t apply e.g. the benchmarks are made to match the subjective opinion that Anthropic models feel much better)

No = everything I said above is largely irrelevant and there’s no such divergence

N/A / unclear is if we’re still in some state of relative ambiguity like we are now and no such conclusion can be made

To recap the main motivation for this market is that I initially was interested in whether OpenAI willfail / collapse / have its bubble pop based on recent news like their 1.4T investment commitment being lowered to 600B e.g. if / when they fall out of the top 3 but I don’t think that’s very likely short term particularly with their lead in math research, so this market is more focused on the divergence between the “Intelligence” and how that translates into market share / revenue in the next 1-2 years

Manifold Markets · Answer

Per Manifold Markets prediction market, Yes, followed by N/A (unclear) and No are most likely. See the market for live updates (6 traders, as of Jun 3, 2026).

People are also trading

Related questions