Key frame networks

Invention Grant

US12046227B2 Key frame networks 有权

Please log in to see more content

Patent Title: Key frame networks
Application No.: US17659840

Application Date: 2022-04-19
Publication No.: US12046227B2

Publication Date: 2024-07-23
Inventor: Tom Marius Kenter , Tobias Alexander Hawker , Robert Clark
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: Honigman LLP
Agent Brett A. Krueger; Grant Griffith
Main IPC: G10L13/08
IPC: G10L13/08 ; G10L15/02 ; G10L15/06 ; G10L15/187

Abstract:

A method for generating frame values using a key frame network includes receiving a text utterance having at least one phoneme, and for each respective phoneme of the at least one phoneme, predicting, using a predictive model, a fixed quantity of key frames. Each respective key frame of the fixed quantity of key frames includes a representation of a component of the respective phoneme. The method also includes generating, using the fixed quantity of key frames, a plurality of frame values. Here, each respective frame value of the plurality of frame values is representative of a fixed-duration of audio.

Public/Granted literature

US20230335110A1 Key Frame Networks Public/Granted day:2023-10-19

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定