Generating acoustic sequences via neural networks using combined prosody info

Invention Grant

US11322135B2 Generating acoustic sequences via neural networks using combined prosody info 有权

Please log in to see more content

Patent Title: Generating acoustic sequences via neural networks using combined prosody info
Application No.: US16568289

Application Date: 2019-09-12
Publication No.: US11322135B2

Publication Date: 2022-05-03
Inventor: Vyacheslav Shechtman
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agent Barry D. Blount
Main IPC: G10L15/00
IPC: G10L15/00 ; G10L15/18 ; G06N3/04 ; G10L15/16

Generating acoustic sequences via neural networks using combined prosody info

Abstract:

An example system includes a processor to receive a linguistic sequence and a prosody info offset. The processor can generate, via a trained prosody info predictor, combined prosody info including a number of observations based on the linguistic sequence. The number of observations include linear combinations of statistical measures evaluating a prosodic component over a predetermined period of time. The processor can generate, via a trained neural network, an acoustic sequence based on the combined prosody info, the prosody info offset, and the linguistic sequence.

Public/Granted literature

US20210082408A1 GENERATING ACOUSTIC SEQUENCES VIA NEURAL NETWORKS USING COMBINED PROSODY INFO Public/Granted day:2021-03-18

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）