Invention Grant
- Patent Title: Transformer-based automatic speech recognition system incorporating time-reduction layer
-
Application No.: US17076794Application Date: 2020-10-21
-
Publication No.: US11715461B2Publication Date: 2023-08-01
- Inventor: Md Akmal Haidar , Chao Xing
- Applicant: Md Akmal Haidar , Chao Xing
- Applicant Address: CA Montreal
- Assignee: HUAWEI TECHNOLOGIES CO., LTD.
- Current Assignee: HUAWEI TECHNOLOGIES CO., LTD.
- Current Assignee Address: CN Shenzhen
- Main IPC: G10L15/16
- IPC: G10L15/16 ; G10L15/06

Abstract:
Computer implemented method and system for automatic speech recognition. A first speech sequence is processed, using a time reduction operation of an encoder NN, into a second speech sequence comprising a second set of speech frame feature vectors that each concatenate information from a respective plurality of speech frame feature vectors included in the first set and includes fewer speech frame feature vectors than the first speech sequence. The second speech sequence is transformed, using a self-attention operation of the encoder NN, into a third speech sequence comprising a third set of speech frame feature vectors. The third speech sequence is processed using a probability operation of the encoder NN, to predict a sequence of first labels corresponding to the third set of speech frame feature vectors, and using a decoder NN to predict a sequence of second labels corresponding to the third set of speech frame feature vectors.
Public/Granted literature
- US20220122590A1 TRANSFORMER-BASED AUTOMATIC SPEECH RECOGNITION SYSTEM INCORPORATING TIME-REDUCTION LAYER Public/Granted day:2022-04-21
Information query