Invention Grant
- Patent Title: Using machine-learning models to determine movements of a mouth corresponding to live speech
-
Application No.: US16016418Application Date: 2018-06-22
-
Publication No.: US10699705B2Publication Date: 2020-06-30
- Inventor: Wilmot Li , Jovan Popovic , Deepali Aneja , David Simons
- Applicant: Adobe Inc.
- Applicant Address: US CA San Jose
- Assignee: Adobe Inc.
- Current Assignee: Adobe Inc.
- Current Assignee Address: US CA San Jose
- Agency: Kilpatrick Townsend & Stockton LLP
- Main IPC: G10L15/197
- IPC: G10L15/197 ; G06N3/04 ; G06N3/08 ; G10L15/02 ; G10L15/06 ; G10L21/0316 ; G10L25/21 ; G10L25/24

Abstract:
Disclosed systems and methods predict visemes from an audio sequence. A viseme-generation application accesses a first set of training data that includes a first audio sequence representing a sentence spoken by a first speaker and a sequence of visemes. Each viseme is mapped to a respective audio sample of the first audio sequence. The viseme-generation application creates a second set of training data adjusting a second audio sequence spoken by a second speaker speaking the sentence such that the second and first sequences have the same length and at least one phoneme occurs at the same time stamp in the first sequence and in the second sequence. The viseme-generation application maps the sequence of visemes to the second audio sequence and trains a viseme prediction model to predict a sequence of visemes from an audio sequence.
Public/Granted literature
- US20190392823A1 USING MACHINE-LEARNING MODELS TO DETERMINE MOVEMENTS OF A MOUTH CORRESPONDING TO LIVE SPEECH Public/Granted day:2019-12-26
Information query