Microsoft’s VASA-1 takes AI-generated video one step closer to ‘aw hell, we’re all doomed’

A promotional image for Microsoft


With generative AI being a key feature of all its new software and hardware projects, it should be no surprise that Microsoft has been developing its own machine learning models. VASA-1 is one such example, where a single image of a person and an audio track can be converted into a convincing video clip of said person speaking the recording.

Just a few years ago, anything created via generative AI was instantly identifiable, by several factors. With still images, it would be things like the number of fingers on a person’s hand or even just something as simple as having the correct number of legs. AI-generated video was even worse, but at least it was very meme-worthy.



Source link