@teortaxesTex' DeepSeek V4 predictions thread
7
1.9kṀ2261Jan 1
37%
>=1.5T parameters
59%
>=52B active parameters
61%
>=25T pretraining tokens
51%
uses some non-AdamW optimizer
35%
DS-MoE with adaptative expert count
41%
intra-expert communication
51%
>=512 experts
52%
>=16 active experts
59%
>= 2 shared experts
72%
Some variation of NSA (Native Sparse Attention)
49%
1M+ Context
22%
Gemini 2.5 Pro tier or higher on FictionBench (90.6%+ at 192k)
15%
>= 44% on Humanity's Last Exam (text only) at scale.com leaderboard
16%
>= 73% on SWE-Bench Verified (according to epoch.ai)
21%
>= 60% on BrowseComp (https://www.kaggle.com/benchmarks/openai/browsecomp)
29%
>= 50% on TerminalBench (https://www.tbench.ai/leaderboard)
39%
Some image input (multimodality)
20%
DeepSeek reports some results with a full-blown deep research agent, and emphasizes that this is the intended use-mode
Teortaxes gave some point estimates. These are not as amenable to prediction market forecasting so I turned them into over/under forecasts. I may add forecasts from other commenters in the thread later on, so these may not only be forecasts by Teo
See post for more (including forecasts I wasn't able to turn into market options):
This question is managed and resolved by Manifold.
Get
1,000 to start trading!
People are also trading
Did DeepSeek lie about the GPU compute budget they used in the training of v3?
12% chance
When will DeepSeek release V4?
When will Deepseek V4 be released?
1/15/26
When will DeepSeek release R2?
When will Deepseek R2 be released?
3/31/26
Will Deepseek release Deepseek V3.3?
31% chance
will deepseek-v4 destroy all other models?
15% chance
Will DeepSeek's next reasoning model be open-sourced?
83% chance
Will DeepSeek R2 be open source?
79% chance
Will DeepSeek's next reasoning model be called R3?
1% chance
Sort by:
@ookina_inu hmmm dunno the details enough to evaluate this. i'd default to asking teo maybe. if you know the details of both DSA and NSA and have an opinion one way or another lmk
@Bayesian Gotcha. I honestly think this could go either way. Seems sufficiently different from NSA to not literally be NSA, but plausibly could fit in “some variant of.” Will update if I form a stronger opinion
People are also trading
Related questions
Did DeepSeek lie about the GPU compute budget they used in the training of v3?
12% chance
When will DeepSeek release V4?
When will Deepseek V4 be released?
1/15/26
When will DeepSeek release R2?
When will Deepseek R2 be released?
3/31/26
Will Deepseek release Deepseek V3.3?
31% chance
will deepseek-v4 destroy all other models?
15% chance
Will DeepSeek's next reasoning model be open-sourced?
83% chance
Will DeepSeek R2 be open source?
79% chance
Will DeepSeek's next reasoning model be called R3?
1% chance