Will Deepseek V4 outperform OpenAI and Anthropic models at coding?
10
100Ṁ199
Dec 31
17%
chance

Claim: https://x.com/petergostev/status/2009616928763981963

Will Deepseek V4 outperform OpenAI's and Anthropic's strongest contemporary models at the time of its release?

Relevant coding benchmarks:

  • SWE-bench Verified

  • HumanEval

  • TerminalBench

  • RE-Bench

  • LiveCodeBench

Deepseek V4 must score higher than both OpenAI's and Anthropic's strongest latest released models on 3/5 of these benchmarks (official or independent benchmark results) to resolve YES. If V4 matches or underperforms either of its competitors on more than half of those benchmarks, it resolves NO. If a certain benchmark is not reported within 1 month of release, that benchmark counts as a loss for Deepseek V4.

Market context
Get
Ṁ1,000
to start trading!
© Manifold Markets, Inc.TermsPrivacy