Using speech to text data in training text to speech models

Invention Grant

US11699430B2 Using speech to text data in training text to speech models 有权

Please log in to see more content

Patent Title: Using speech to text data in training text to speech models
Application No.: US17245048

Application Date: 2021-04-30
Publication No.: US11699430B2

Publication Date: 2023-07-11
Inventor: Andrew R. Freed , Vamshi Krishna Thotempudi , Sujatha B. Perepa
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agent David K. Mattheis
Main IPC: G10L13/08
IPC: G10L13/08 ; G10L13/06 ; G06N20/00

Using speech to text data in training text to speech models

Abstract:

A system and method for providing a text to speech output by receiving user audio data, determining a user region-specific-pronunciation classification according to the audio data, determining text for a response to the user according to the audio data, identifying a portion from the text, where a region specific-pronunciation dictionary includes the portion, and using a phoneme string, from the dictionary selected according to the user region-specific pronunciation classification, for the word in a text to speech output to the user.

Public/Granted literature

US20220351715A1 USING SPEECH TO TEXT DATA IN TRAINING TEXT TO SPEECH MODELS Public/Granted day:2022-11-03

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/08	.文本分析或文本以外的语音合成参数的产生，例如语义图翻译为音素、韵律产生、重音或声调测定