Which organization will first release an open weights video generation model with fidelity as good as Sora?
29
162
1.2K
2025
31%
Meta
22%
Stability AI
14%
Mistral
9%
Other
7%
nvidia
5%
Google
2%
Alibaba
2%
Bytedance
2%
OpenAI
1.8%
Natural Synthetics Inc.
1.8%
Shanghai AI Lab
1.2%
ModelScope
1.1%
Huggingface

OpenAI has just announced a new text-to-video model known as Sora which has unprecedented visual quality and object permanence.

This question asks: what company will first release an open-weights text-to-video model (or image-to-video model) with fidelity equal to or greater than Sora.

In order to resolve this question positive the model must be open-weights, meaning anyone can download the model weights (possibly after signing a disclaimer), but need not be open-source. For example it could be research-only or restricted for commercial use.

Notable existing open weights video generation include:
Stable Video Diffusion: Stability AI
Hotshot XL: Natural Synthetics Inc.
Animate LCM: Shanghai AI Lab
I2VGen-XL: Alibaba
ByteDance: MagicAnimate
ModelScope: Modelscope text-to-video

(new answers can be added to this question)

Judgement of quality will be my personal judgement, unless OpenAI releases official scores (for example video FID) of Sora's performance. In order to resolve positive, a model must at a minimum: produce videos of length >=60s, demonstrate object-permanence, most of the time generated humans and animals have the correct number of arms/legs/fingers.

Get Ṁ200 play money
Sort by:

StabilityAI CEO announces Stable Diffusion 3

why tf is mistral so high

@ashly_webb secretly hoping the answer is "insider trading"

More related questions