Scalable entities and patterns mining pipeline to improve automatic speech recognition

Invention Grant

US12087286B2 Scalable entities and patterns mining pipeline to improve automatic speech recognition 有权

Please log in to see more content

Patent Title: Scalable entities and patterns mining pipeline to improve automatic speech recognition
Application No.: US17313146

Application Date: 2021-05-06
Publication No.: US12087286B2

Publication Date: 2024-09-10
Inventor: Ankur Gupta , Satarupa Guha , Rupeshkumar Rasiklal Mehta , Issac John Alphonso , Anastasios Anastasakos , Shuangyu Chang
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
Current Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
Current Assignee Address: US WA Redmond
Agency: CALFEE, HALTER & GRISWOLD LLP
Main IPC: G10L15/18
IPC: G10L15/18 ; G06F16/332 ; G10L15/22 ; G06N5/022 ; G10L15/08

Scalable entities and patterns mining pipeline to improve automatic speech recognition

Abstract:

A computing system obtains features that have been extracted from an acoustic signal, where the acoustic signal comprises spoken words uttered by a user. The computing system performs automatic speech recognition (ASR) based upon the features and a language model (LM) generated based upon expanded pattern data. The expanded pattern data includes a name of an entity and a search term, where the entity belongs to a segment identified in a knowledge base. The search term has been included in queries for entities belonging to the segment. The computing system identifies a sequence of words corresponding to the features based upon results of the ASR. The computing system transmits computer-readable text to a search engine, where the text includes the sequence of words.

Public/Granted literature

US20220358910A1 SCALABLE ENTITIES AND PATTERNS MINING PIPELINE TO IMPROVE AUTOMATIC SPEECH RECOGNITION Public/Granted day:2022-11-10

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/18	..利用自然语言模型