System and method for automatic prediction of speech suitability for statistical modeling

Invention Grant

US09484045B2 System and method for automatic prediction of speech suitability for statistical modeling 有权

Title translation: 自动预测语音适用性的统计建模系统和方法

Please log in to see more content

Patent Title: System and method for automatic prediction of speech suitability for statistical modeling
Patent Title (中): 自动预测语音适用性的统计建模系统和方法
Application No.: US13606618

Application Date: 2012-09-07
Publication No.: US09484045B2

Publication Date: 2016-11-01
Inventor: Alexander Sorin , Slava Shechtman , Vincent Pollet
Applicant: Alexander Sorin , Slava Shechtman , Vincent Pollet
Applicant Address: US MA Burlington
Assignee: Nuance Communications, Inc.
Current Assignee: Nuance Communications, Inc.
Current Assignee Address: US MA Burlington
Agency: Hamilton, Brook, Smith & Reynolds, P.C.
Main IPC: G10L13/06
IPC: G10L13/06 ; G10L13/04 ; G10L19/00 ; G10L25/48 ; G10L25/18

System and method for automatic prediction of speech suitability for statistical modeling

Abstract:

An embodiment according to the invention provides a capability of automatically predicting how favorable a given speech signal is for statistical modeling, which is advantageous in a variety of different contexts. In Multi-Form Segment (MFS) synthesis, for example, an embodiment according to the invention uses prediction capability to provide an automatic acoustic driven template versus model decision maker with an output quality that is high, stable and depends gradually on the system footprint. In speaker selection for a statistical Text-to-Speech synthesis (TTS) system build, as another example context, an embodiment according to the invention enables a fast selection of the most appropriate speaker among several available ones for the full voice dataset recording and preparation, based on a small amount of recorded speech material.

Abstract(Chinese):

根据本发明的实施例提供了一种自动预测给定语音信号对于统计建模有利的能力，这在各种不同的上下文中是有利的。在多格段（MFS）合成中，例如，根据本发明的实施例使用预测能力来提供具有高，稳定的输出质量的自动声驱动模板与模型决策者，并逐渐依赖于系统占用。在用于统计文本到语音合成（TTS）系统构建的说话者选择中，作为另一示例性上下文，根据本发明的实施例使得能够在完整语音数据集记录和准备中的几个可用的扬声器中快速选择最合适的说话者，基于少量的录音材料。

Public/Granted literature

US20140074468A1 System and Method for Automatic Prediction of Speech Suitability for Statistical Modeling Public/Granted day:2014-03-13

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/06	.语音合成设备中使用的基本语音单元；级联规则