Noisy student teacher training for robust keyword spotting

Invention Grant

US12027162B2 Noisy student teacher training for robust keyword spotting 有权

Please log in to see more content

Patent Title: Noisy student teacher training for robust keyword spotting
Application No.: US17190779

Application Date: 2021-03-03
Publication No.: US12027162B2

Publication Date: 2024-07-02
Inventor: Hyun Jin Park , Pai Zhu , Ignacio Lopez Moreno , Niranjan Subrahmanya
Applicant: GOOGLE LLC
Applicant Address: US CA Mountain View
Assignee: GOOGLE LLC
Current Assignee: GOOGLE LLC
Current Assignee Address: US CA Mountain View
Agency: Gray Ice Higdon
Main IPC: G10L15/22
IPC: G10L15/22 ; G06F18/24 ; G10L15/06 ; G10L15/08 ; G10L21/0208

Noisy student teacher training for robust keyword spotting

Abstract:

Teacher-student learning can be used to train a keyword spotting (KWS) model using augmented training instance(s). Various implementations include aggressively augmenting (e.g., using spectral augmentation) base audio data to generate augmented audio data, where one or more portions of the base instance of audio data can be masked in the augmented instance of audio data (e.g., one or more time frames can be masked, one or more frequencies can be masked, etc.). Many implementations include processing augmented audio data using a KWS teacher model to generate a soft label, and processing the augmented audio data using a KWS student model to generate predicted output. One or more portions of the KWS student model can be updated based on a comparison of the soft label and the generated predicted output.

Public/Granted literature

US20220284891A1 NOISY STUDENT TEACHER TRAINING FOR ROBUST KEYWORD SPOTTING Public/Granted day:2022-09-08

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/22	.在语音识别过程中（例如在人机对话过程中）使用的程序