Method and apparatus of training acoustic feature extracting model, device and computer storage medium

Invention Grant

US10943582B2 Method and apparatus of training acoustic feature extracting model, device and computer storage medium 有权

Please log in to see more content

Patent Title: Method and apparatus of training acoustic feature extracting model, device and computer storage medium
Application No.: US15979018

Application Date: 2018-05-14
Publication No.: US10943582B2

Publication Date: 2021-03-09
Inventor: Bing Jiang , Xiaokong Ma , Chao Li , Xiangang Li
Applicant: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
Applicant Address: CN Beijing
Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
Current Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
Current Assignee Address: CN Beijing
Agency: Brooks Kushman PC
Priority: CN201710359207.1 20170519
Main IPC: G10L15/16
IPC: G10L15/16 ; G06N3/04 ; G10L15/02 ; G10L15/06 ; G06N3/08

Method and apparatus of training acoustic feature extracting model, device and computer storage medium

Abstract:

A method and apparatus of training an acoustic feature extracting model, a device and a computer storage medium. The method comprises: considering a first acoustic feature extracted respectively from speech data corresponding to user identifiers as training data; training an initial model based on a deep neural network based on a criterion of a minimum classification error, until a preset first stop condition is reached; using a triplet loss layer to replace a Softmax layer in the initial model to constitute an acoustic feature extracting model, and continuing to train the acoustic feature extracting model until a preset second stop condition is reached, the acoustic feature extracting model being used to output a second acoustic feature of the speech data; wherein the triplet loss layer is used to maximize similarity between the second acoustic features of the same user, and minimize similarity between the second acoustic features of different users.

Public/Granted literature

US20180336888A1 Method and Apparatus of Training Acoustic Feature Extracting Model, Device and Computer Storage Medium Public/Granted day:2018-11-22

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/16	..利用人工神经网络