Split-model architecture for DNN-based small corpus voice conversion

Invention Grant

US10453476B1 Split-model architecture for DNN-based small corpus voice conversion 有权

Please log in to see more content

Patent Title: Split-model architecture for DNN-based small corpus voice conversion
Application No.: US15657003

Application Date: 2017-07-21
Publication No.: US10453476B1

Publication Date: 2019-10-22
Inventor: Sandesh Aryal
Applicant: Sandesh Aryal
Applicant Address: US CA Pasadena
Assignee: OBEN, INC.
Current Assignee: OBEN, INC.
Current Assignee Address: US CA Pasadena
Agent Andrew S. Naglestad
Main IPC: G10L13/033
IPC: G10L13/033 ; G10L25/24 ; G10L25/30 ; G06N3/04 ; G10L21/013

Split-model architecture for DNN-based small corpus voice conversion

Abstract:

A voice conversion system suitable for encoding small and large corpuses is disclosed. The voice conversion system comprises hardware including a neural network for generating estimated target speech data based on source speech data. The neural network includes an input layer, an output layer, and a novel split-model hidden layer. The input layer comprises a first portion and a second portion. The output layer comprises a third portion and a fourth portion. The hidden layer comprises a first subnet and a second subnet, wherein the first subnet is directly connected to the first portion of the input layer and the third portion of the output layer, and wherein the second subnet is directly connected to the second portion of the input layer and the fourth portion of the output layer. The first subnet and second subnet operate in parallel, and link to different but overlapping nodes of the input layer.

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备
G10L13/033	..声音编辑，例如操控合成设备的声音