Proper noun recognition in end-to-end speech recognition

Invention Grant

US11749259B2 Proper noun recognition in end-to-end speech recognition 有权

Please log in to see more content

Patent Title: Proper noun recognition in end-to-end speech recognition
Application No.: US17150491

Application Date: 2021-01-15
Publication No.: US11749259B2

Publication Date: 2023-09-05
Inventor: Charles Caleb Peyser , Tara N. Sainath , Golan Pundak
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: Honigman LLP
Agent Brett A. Krueger; Grant J. Griffith
Main IPC: G10L15/06
IPC: G10L15/06 ; G10L15/16 ; G10L15/18 ; G10L15/187 ; G06N3/049

Proper noun recognition in end-to-end speech recognition

Abstract:

A method for training a speech recognition model with a minimum word error rate loss function includes receiving a training example comprising a proper noun and generating a plurality of hypotheses corresponding to the training example. Each hypothesis of the plurality of hypotheses represents the proper noun and includes a corresponding probability that indicates a likelihood that the hypothesis represents the proper noun. The method also includes determining that the corresponding probability associated with one of the plurality of hypotheses satisfies a penalty criteria. The penalty criteria indicating that the corresponding probability satisfies a probability threshold, and the associated hypothesis incorrectly represents the proper noun. The method also includes applying a penalty to the minimum word error rate loss function.

Public/Granted literature

US20210233512A1 Proper Noun Recognition in End-to-End Speech Recognition Public/Granted day:2021-07-29

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）