Data augmentation method based on stochastic feature mapping for automatic speech recognition

Invention Grant

US09824683B2 Data augmentation method based on stochastic feature mapping for automatic speech recognition 有权

Please log in to see more content

Patent Title: Data augmentation method based on stochastic feature mapping for automatic speech recognition
Application No.: US14977674

Application Date: 2015-12-22
Publication No.: US09824683B2

Publication Date: 2017-11-21
Inventor: Xiaodong Cui , Vaibhava Goel , Brian E. D. Kingsbury
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee Address: US NY Armonk
Agency: Otterstedt, Ellenbogen & Kammer, LLP
Agent David M. Quinn
Main IPC: G10L15/00
IPC: G10L15/00 ; G10L15/06 ; G10L21/0272 ; G10L15/16 ; G10L15/02

Data augmentation method based on stochastic feature mapping for automatic speech recognition

Abstract:

A method of augmenting training data includes converting a feature sequence of a source speaker determined from a plurality of utterances within a transcript to a feature sequence of a target speaker under the same transcript, training a speaker-dependent acoustic model for the target speaker for corresponding speaker-specific acoustic characteristics, estimating a mapping function between the feature sequence of the source speaker and the speaker-dependent acoustic model of the target speaker, and mapping each utterance from each speaker in a training set using the mapping function to multiple selected target speakers in the training set.

Public/Granted literature

US20170200446A1 DATA AUGMENTATION METHOD BASED ON STOCHASTIC FEATURE MAPPING FOR AUTOMATIC SPEECH RECOGNITION Public/Granted day:2017-07-13

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）