Efficiency adjustable speech recognition system

Invention Grant

US11715462B2 Efficiency adjustable speech recognition system 有权

Please log in to see more content

Patent Title: Efficiency adjustable speech recognition system
Application No.: US17244891

Application Date: 2021-04-29
Publication No.: US11715462B2

Publication Date: 2023-08-01
Inventor: Yu Wu , Jinyu Li , Shujie Liu , Xie Chen , Chengyi Wang
Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Agency: Workman Nydegger
Main IPC: G10L15/16
IPC: G10L15/16 ; G06N3/08 ; G10L15/06 ; G10L15/22 ; G06N3/044

Efficiency adjustable speech recognition system

Abstract:

A computing system is configured to generate a transformer-transducer-based deep neural network. The transformer-transducer-based deep neural network comprises a transformer encoder network and a transducer predictor network. The transformer encoder network has a plurality of layers, each of which includes a multi-head attention network sublayer and a feed-forward network sublayer. The computing system trains an end-to-end (E2E) automatic speech recognition (ASR) model, using the transformer-transducer-based deep neural network. The E2E ASR model has one or more adjustable hyperparameters that are configured to dynamically adjust an efficiency or a performance of E2E ASR model when the E2E ASR model is deployed onto a device or executed by the device.

Public/Granted literature

US20220351718A1 EFFICIENCY ADJUSTABLE SPEECH RECOGNITION SYSTEM Public/Granted day:2022-11-03

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/16	..利用人工神经网络