Condition-invariant feature extraction network

Invention Grant

US11217265B2 Condition-invariant feature extraction network 有权

Please log in to see more content

Patent Title: Condition-invariant feature extraction network
Application No.: US16434665

Application Date: 2019-06-07
Publication No.: US11217265B2

Publication Date: 2022-01-04
Inventor: Zhong Meng , Yong Zhao , Jinyu Li , Yifan Gong
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Agency: Buckley, Maschoff & Talwalkar LLC
Main IPC: G10L25/03
IPC: G10L25/03 ; G10L25/30 ; H04R29/00 ; G06N20/00 ; G06F17/18 ; G06N3/08 ; H04R5/04

Condition-invariant feature extraction network

Abstract:

To generate substantially condition-invariant and speaker-discriminative features, embodiments are associated with a feature extractor capable of extracting features from speech frames based on first parameters, a speaker classifier capable of identifying a speaker based on the features and on second parameters, and a condition classifier capable of identifying a noise condition based on the features and on third parameters. The first parameters of the feature extractor and the second parameters of the speaker classifier are trained to minimize a speaker classification loss, the first parameters of the feature extractor are further trained to maximize a condition classification loss, and the third parameters of the condition classifier are trained to minimize the condition classification loss.

Public/Granted literature

US20200335122A1 CONDITION-INVARIANT FEATURE EXTRACTION NETWORK Public/Granted day:2020-10-22

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/03	.以提取参数类型为特征的