Invention Grant
- Patent Title: Adaptive beam pruning for automatic speech recognition
-
Application No.: US15196184Application Date: 2016-06-29
-
Publication No.: US10199037B1Publication Date: 2019-02-05
- Inventor: Denis Sergeyevich Filimonov , Yuan Shangguan
- Applicant: AMAZON TECHNOLOGIES, INC.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Pierce Atwood LLP
- Main IPC: G10L15/02
- IPC: G10L15/02 ; G10L15/06 ; G10L15/08 ; G10L15/28

Abstract:
A reduced latency system for automatic speech recognition (ASR). The system can use certain feature values describing the state of ASR processing to estimate how far a lowest scoring node for an audio frame is from a potential node likely be part of the Viterbi path. The system can then adjust its beam width in a manner likely to encompass the node likely to be on the Viterbi path, thus pruning unnecessary nodes and reducing latency. The feature values and estimated distances may be based on a set of training data, where the system identifies specific nodes on the Viterbi path and determines what feature values correspond to what desired beam widths. Trained models or other data may be created at training and used at runtime to dynamically adjust the beam width, as well as other settings such as threshold number of active nodes.
Information query