Invention Grant
- Patent Title: Deep neural network based non-autoregressive speech synthesizer method and system using multiple decoder
-
Application No.: US17599793Application Date: 2020-06-26
-
Publication No.: US12033613B2Publication Date: 2024-07-09
- Inventor: Joon-Hyuk Chang , Moa Lee
- Applicant: IUCF-HYU (INDUSTRY-UNIVERSITY COOPERATION FOUNDATION HANYANG UNIVERSITY)
- Applicant Address: KR Seoul
- Assignee: IUCF-HYU (INDUSTRY-UNIVERSITY COOPERATION FOUNDATION HANYANG UNIVERSITY)
- Current Assignee: IUCF-HYU (INDUSTRY-UNIVERSITY COOPERATION FOUNDATION HANYANG UNIVERSITY)
- Current Assignee Address: KR Seoul
- Agency: Sughrue Mion, PLLC
- Priority: KR 20190085920 2019.07.16
- International Application: PCT/KR2020/008330 2020.06.26
- International Announcement: WO2021/010613A 2021.01.21
- Date entered country: 2021-09-29
- Main IPC: G10L13/047
- IPC: G10L13/047 ; G06N3/08 ; G10L25/30

Abstract:
Proposed are a deep neural network-based non-autoregressive voice synthesizing method and a system therefor. A deep neural network-based non-autoregressive voice synthesizing system according to an embodiment may comprise: a voice feature vector column synthesizing unit which constitutes a non-recursive deep neural network based on multiple decoders, and gradually produces a voice feature vector column through the multiple decoders from a template including temporal information of a voice; and a voice reconstituting unit which transforms the voice feature vector column into voice data, wherein the voice feature vector column synthesizing unit produces a template input, and produces a voice feature vector column by adding, to the template input, sentence data refined through an attention mechanism.
Public/Granted literature
- US20220108681A1 DEEP NEURAL NETWORK BASED NON-AUTOREGRESSIVE SPEECH SYNTHESIZER METHOD AND SYSTEM USING MULTIPLE DECODER Public/Granted day:2022-04-07
Information query