Invention Grant
- Patent Title: Speech waveform generation
-
Application No.: US17272325Application Date: 2018-09-30
-
Publication No.: US11869482B2Publication Date: 2024-01-09
- Inventor: Yang Cui , Xi Wang , Lei He , Kao-Ping Soong
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Schwegman Lundberg & Woessner, P.A.
- International Application: PCT/CN2018/109044 2018.09.30
- International Announcement: WO2020/062217A 2020.04.02
- Date entered country: 2021-02-28
- Main IPC: G10L13/047
- IPC: G10L13/047

Abstract:
A method and apparatus for generating a speech waveform. Fundamental frequency information, glottal features and vocal tract features associated with an input may be received, wherein the glottal features include a phase feature, a shape feature, and an energy feature (1310). A glottal waveform is generated based on the fundamental frequency information and the glottal features through a first neural network model (1320). A speech waveform is generated based on the glottal waveform and the vocal tract features through a second neural network model (1330).
Public/Granted literature
- US20210193112A1 SPEECH WAVEFORM GENERATION Public/Granted day:2021-06-24
Information query