News
The primary application of VASA-1 is aimed at the creation of virtual characters. The model excels in generating lip movements that align precisely with the accompanying audio.
VASA-1 takes in a single portrait photo and an audio file and converts it into a hyper realistic talking face video complete with lip sync, realistic facial features and head movement. The model ...
On Tuesday, Microsoft Research Asia unveiled VASA-1, an AI model that can create a synchronized animated video of a person talking or singing from a single photo and an existing audio track.
Results that may be inaccessible to you are currently showing.
Hide inaccessible results