Text-to-speech with emotional content

Invention Grant

US09824681B2 Text-to-speech with emotional content 有权

Please log in to see more content

Patent Title: Text-to-speech with emotional content
Application No.: US14483153

Application Date: 2014-09-11
Publication No.: US09824681B2

Publication Date: 2017-11-21
Inventor: Jian Luan , Lei He , Max Leung
Applicant: Microsoft Corporation
Applicant Address: US WA Redmond
Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
Current Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
Current Assignee Address: US WA Redmond
Agency: Law Offices of Richard Chi
Agent Richard Chi
Main IPC: G10L13/00
IPC: G10L13/00 ; G10L13/08 ; G10L13/027 ; G10L13/033

Abstract:

Techniques for converting text to speech having emotional content. In an aspect, an emotionally neutral acoustic trajectory is predicted for a script using a neutral model, and an emotion-specific acoustic trajectory adjustment is independently predicted using an emotion-specific model. The neutral trajectory and emotion-specific adjustments are combined to generate a transformed speech output having emotional content. In another aspect, state parameters of a statistical parametric model for neutral voice are transformed by emotion-specific factors that vary across contexts and states. The emotion-dependent adjustment factors may be clustered and stored using an emotion-specific decision tree or other clustering scheme distinct from a decision tree used for the neutral voice model.

Public/Granted literature

US20160078859A1 TEXT-TO-SPEECH WITH EMOTIONAL CONTENT Public/Granted day:2016-03-17

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统