System and method for building a voice database

Invention Grant

US10861476B2 System and method for building a voice database 有权

Please log in to see more content

Patent Title: System and method for building a voice database
Application No.: US15989065

Application Date: 2018-05-24
Publication No.: US10861476B2

Publication Date: 2020-12-08
Inventor: William Carter Huffman , Michael Pappas
Applicant: Modulate, Inc.
Applicant Address: US MA Cambridge
Assignee: Modulate, Inc.
Current Assignee: Modulate, Inc.
Current Assignee Address: US MA Cambridge
Agency: Nutter McClennen & Fish LLP
Main IPC: G10L21/00
IPC: G10L21/00 ; G10L21/013 ; G10L15/02 ; G10L15/22 ; G10L15/06 ; G10L19/018 ; G10L25/30

System and method for building a voice database

Abstract:

A timbre vector space construction system for building a timbre vector space has an input. The input is configured to receive a first speech segment in a first voice and a second speech segment in a second voice. The system also includes a temporal receptive field to transform the first speech segment into a first plurality of analytical segments, and the second speech segment into a second plurality of analytical segments. Each of the first plurality of smaller analytical segments, and each of the second plurality of analytical segments have a frequency distribution that represents a different portion of the timbre data of the respective voices. The system also includes a machine learning system configured to map the first voice relative to the second voice in the timbre vector space as a function of the frequency distribution of the first plurality of analytical segments the second plurality of analytical segments.

Public/Granted literature

US20180342257A1 System and Method for Building a Voice Database Public/Granted day:2018-11-29

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L21/00	为了改变语音或声音信号的质量或其可识度而处理语音或声音信号，以产生另一种可听的或非可听的信号，例如视觉信号或触觉信号（G10L19/00优先）