Systems, apparatuses, and methods for speaker verification using artificial neural networks

Invention Grant

US10916254B2 Systems, apparatuses, and methods for speaker verification using artificial neural networks 有权

Please log in to see more content

Patent Title: Systems, apparatuses, and methods for speaker verification using artificial neural networks
Application No.: US16327147

Application Date: 2016-08-22
Publication No.: US10916254B2

Publication Date: 2021-02-09
Inventor: Volodya Grancharov , Stefano Imoscopi , Sigurdur Sverrisson
Applicant: Telefonaktiebolaget LM Ericsson (publ)
Applicant Address: SE Stockholm
Assignee: Telefonaktiebolaget LM Ericsson (publ)
Current Assignee: Telefonaktiebolaget LM Ericsson (publ)
Current Assignee Address: SE Stockholm
Agency: Murphy, Bilak & Homiller, PLLC
International Application: PCT/EP2016/069832 WO 20160822
International Announcement: WO2018/036610 WO 20180301
Main IPC: G10L17/18
IPC: G10L17/18 ; G10L17/04 ; G06N3/04

Systems, apparatuses, and methods for speaker verification using artificial neural networks

Abstract:

In one aspect, instead of discriminative training a single K-class ANN, a proposed architecture discriminative trains K ANNs (e.g., the following K 2-class ANNs are trained: ANN_1, ANN_2, ANN_K). Each one of these K 2-class ANNs learns to discriminate between audio material from one of the enrolled speakers and “average” speech material (e.g., a feature vector generated using a Gaussian Mixture Model trained Universal Background Model (GMM-UBM)). That is, for example, ANN_i is trained to discriminate between audio material from the ith enrolled speaker and the “average” speech material. In the event that a new enrolled speaker is to be added to the system, an additional ANN is trained (e.g., ANN_(K+1)) with the available audio material (audio features) from that particular speaker and audio features produced from the GMM-UBM system.

Public/Granted literature

US20190206410A1 Systems, Apparatuses, and Methods for Speaker Verification using Artificial Neural Networks Public/Granted day:2019-07-04

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L17/00	讲话者辨认或验证
G10L17/18	.人工神经网络，连接方法