Speech waveform generation

Invention Grant

US11869482B2 Speech waveform generation 有权

Please log in to see more content

Patent Title: Speech waveform generation
Application No.: US17272325

Application Date: 2018-09-30
Publication No.: US11869482B2

Publication Date: 2024-01-09
Inventor: Yang Cui , Xi Wang , Lei He , Kao-Ping Soong
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Agency: Schwegman Lundberg & Woessner, P.A.
International Application: PCT/CN2018/109044 2018.09.30
International Announcement: WO2020/062217A 2020.04.02
Date entered country: 2021-02-28
Main IPC: G10L13/047
IPC: G10L13/047

Abstract:

A method and apparatus for generating a speech waveform. Fundamental frequency information, glottal features and vocal tract features associated with an input may be received, wherein the glottal features include a phase feature, a shape feature, and an energy feature (1310). A glottal waveform is generated based on the fundamental frequency information and the glottal features through a first neural network model (1320). A speech waveform is generated based on the glottal waveform and the vocal tract features through a second neural network model (1330).

Public/Granted literature

US20210193112A1 SPEECH WAVEFORM GENERATION Public/Granted day:2021-06-24

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备
G10L13/04	..语音合成系统的零部件，例如合成设备结构或存储器管理
G10L13/047	...语音合成设备的体系结构