Invention Grant
- Patent Title: Photo-realistic synthesis of three dimensional animation with facial features synchronized with speech
-
Application No.: US13099387Application Date: 2011-05-03
-
Publication No.: US09613450B2Publication Date: 2017-04-04
- Inventor: Lijuan Wang , Frank Soong , Qiang Huo , Zhengyou Zhang
- Applicant: Lijuan Wang , Frank Soong , Qiang Huo , Zhengyou Zhang
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Main IPC: G06T13/40
- IPC: G06T13/40 ; G10L21/10

Abstract:
Dynamic texture mapping is used to create a photorealistic three dimensional animation of an individual with facial features synchronized with desired speech. Audiovisual data of an individual reading a known script is obtained and stored in an audio library and an image library. The audiovisual data is processed to extract feature vectors used to train a statistical model. An input audio feature vector corresponding to desired speech with which the animation will be synchronized is provided. The statistical model is used to generate a trajectory of visual feature vectors that corresponds to the input audio feature vector. These visual feature vectors are used to identify a matching image sequence from the image library. The resulting sequence of images, concatenated from the image library, provides a photorealistic image sequence with facial features, such as lip movements, synchronized with the desired speech. This image sequence is applied to the three-dimensional model.
Public/Granted literature
- US20120280974A1 PHOTO-REALISTIC SYNTHESIS OF THREE DIMENSIONAL ANIMATION WITH FACIAL FEATURES SYNCHRONIZED WITH SPEECH Public/Granted day:2012-11-08
Information query