Apparatus and method for generating lip sync image

Invention Grant

US12190903B2 Apparatus and method for generating lip sync image 有权

Please log in to see more content

Patent Title: Apparatus and method for generating lip sync image
Application No.: US17764324

Application Date: 2021-06-03
Publication No.: US12190903B2

Publication Date: 2025-01-07
Inventor: Guem Buel Hwang , Gyeong Su Chae
Applicant: DEEPBRAIN AI INC.
Applicant Address: KR Seoul
Assignee: DEEPBRAIN AI INC.
Current Assignee: DEEPBRAIN AI INC.
Current Assignee Address: KR Seoul
Agency: The PL Law Group, PLLC
Priority: KR10-2020-0172024 20201210
International Application: PCT/KR2021/006913 WO 20210603
International Announcement: WO2022/124498 WO 20220616
Main IPC: G10L21/10
IPC: G10L21/10 ; G06T13/40 ; G06T13/80

Apparatus and method for generating lip sync image

Abstract:

An apparatus for generating a lip sync image according to a disclosed embodiment has one or more processors and a memory which stores one or more programs executed by the one or more processors. The apparatus includes a first artificial neural network model configured to generate an utterance synthesis image by using a person background image and an utterance audio signal corresponding to the person background image as an input, and generate a silence synthesis image by using only the person background image as an input, and a second artificial neural network model configured to output, from a preset utterance maintenance image and the first artificial neural network model, classification values for the preset utterance maintenance image and the silence synthesis image by using the silence synthesis image as an input.

Public/Granted literature

US20230178095A1 APPARATUS AND METHOD FOR GENERATING LIP SYNC IMAGE Public/Granted day:2023-06-08

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L21/00	为了改变语音或声音信号的质量或其可识度而处理语音或声音信号，以产生另一种可听的或非可听的信号，例如视觉信号或触觉信号（G10L19/00优先）
G10L21/06	.将语音转换成非可听表达形式，例如语音可视化、触觉辅助的语音处理（G10L15/26优先）
G10L21/10	..转换成可视信息