OpenAI unveils powerful, creepy new text-to-video generator that it calls ‘a foundation for models that can understand and simulate the real world’

Image for OpenAI unveils powerful, creepy new text-to-video generator that it calls


The generative AI company behind ChatGPT and DALL-E has a new toy: Sora, a text-to-video model that can (sometimes) generate pretty convincing 60-second clips from prompts like “a stylish woman walks down a Tokyo street…” and “a movie trailer featuring the adventures of the 30 year old space man wearing a red wool knitted motorcycle helmet…”

A lot of the AI video generation we’ve seen so far fails to sustain a consistent reality, redesigning faces and clothing and objects from one frame to the next. Sora, however, “understands not only what the user has asked for in the prompt, but also how those things exist in the physical world,” says OpenAI in its announcement post (using the word “understands” loosely).

View post on imgur.com”


Source link