Invention Grant
- Patent Title: Generating acoustic sequences via neural networks using combined prosody info
-
Application No.: US16568289Application Date: 2019-09-12
-
Publication No.: US11322135B2Publication Date: 2022-05-03
- Inventor: Vyacheslav Shechtman
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Barry D. Blount
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G10L15/18 ; G06N3/04 ; G10L15/16

Abstract:
An example system includes a processor to receive a linguistic sequence and a prosody info offset. The processor can generate, via a trained prosody info predictor, combined prosody info including a number of observations based on the linguistic sequence. The number of observations include linear combinations of statistical measures evaluating a prosodic component over a predetermined period of time. The processor can generate, via a trained neural network, an acoustic sequence based on the combined prosody info, the prosody info offset, and the linguistic sequence.
Public/Granted literature
- US20210082408A1 GENERATING ACOUSTIC SEQUENCES VIA NEURAL NETWORKS USING COMBINED PROSODY INFO Public/Granted day:2021-03-18
Information query