
Minimum video length of 2 minutes, and must maintain coherence. The visuals, dialogue, and sound must all be of "reasonable" quality: it does not need to be indistinguishable from human made video, but there shouldn't be significant artifacts.
Is this from a prompt of like "A video of two people having a conversation", or something with more input data, such as the transcript of the dialogue and a starting picture of two people talking to each other?

@vluzko https://www.youtube.com/watch?v=jz78fSnBG0s In what ways does this not pass the test? Because of the video creator splicing the clips together?

@Nikola The splicing hurts it, but the main thing is that this is a question about being able to generate many kinds of video, not any video. Think DALL-E 2 but for video with sound (although I do not require the inputs to be purely text)














Related markets


Related markets

