Invention Grant
- Patent Title: Method, apparatus, device and computer readable storage medium for recognizing and decoding voice based on streaming attention model
-
Application No.: US16813271Application Date: 2020-03-09
-
Publication No.: US11355113B2Publication Date: 2022-06-07
- Inventor: Junyao Shao , Sheng Qian , Lei Jia
- Applicant: Baidu Online Network Technology (Beijing) Co., Ltd.
- Applicant Address: CN Beijing
- Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
- Current Assignee: Baidu Online Network Technology (Beijing) Co., Ltd.
- Current Assignee Address: CN Beijing
- Agency: Nixon Peabody LLP
- Priority: CN201910646762.1 20190717
- Main IPC: G10L15/22
- IPC: G10L15/22 ; G10L15/197 ; G10L15/02 ; G10L15/32

Abstract:
A method, apparatus, device, and computer readable storage medium for recognizing and decoding a voice based on a streaming attention model are provided. The method may include generating a plurality of acoustic paths for decoding the voice using the streaming attention model, and then merging acoustic paths with identical last syllables of the plurality of acoustic paths to obtain a plurality of merged acoustic paths. The method may further include selecting a preset number of acoustic paths from the plurality of merged acoustic paths as retained candidate acoustic paths. Embodiments of the present disclosure present a concept that acoustic score calculating of a current voice fragment is only affected by its last voice fragment and has nothing to do with earlier voice history, and merge acoustic paths with the identical last syllables of the plurality of candidate acoustic paths.
Public/Granted literature
Information query