Mixing heterogeneous loss types to improve accuracy of keyword spotting

Invention Grant

US12125476B2 Mixing heterogeneous loss types to improve accuracy of keyword spotting 有权

Please log in to see more content

Patent Title: Mixing heterogeneous loss types to improve accuracy of keyword spotting
Application No.: US17652801

Application Date: 2022-02-28
Publication No.: US12125476B2

Publication Date: 2024-10-22
Inventor: Hyun Jin Park , Alex Seungryong Park , Ignacio Lopez Moreno
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: Honigman LLP
Agent Brett A. Krueger; Grant Griffith
Main IPC: G10L15/16
IPC: G10L15/16 ; G06N3/08 ; G10L15/02 ; G10L15/06 ; G10L15/22 ; G06N3/0455 ; G10L15/08

Mixing heterogeneous loss types to improve accuracy of keyword spotting

Abstract:

A method for training a neural network includes receiving a training input audio sequence including a sequence of input frames defining a hotword that initiates a wake-up process on a user device. The method further includes obtaining a first label and a second label for the training input audio sequence. The method includes generating, using a memorized neural network and the training input audio sequence, an output indicating a likelihood the training input audio sequence includes the hotword. The method further includes determining a first loss based on the first label and the output. The method includes determining a second loss based on the second label and the output. The method further includes optimizing the memorized neural network based on the first loss and the second loss associated with the training input audio sequence.

Public/Granted literature

US20230274731A1 Mixing Heterogeneous Loss Types to Improve Accuracy of Keyword Spotting Public/Granted day:2023-08-31

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/16	..利用人工神经网络