Systems and methods for automatic speech recognition based on graphics processing units

Invention Grant

US11562734B2 Systems and methods for automatic speech recognition based on graphics processing units 有权

Please log in to see more content

Patent Title: Systems and methods for automatic speech recognition based on graphics processing units
Application No.: US17141200

Application Date: 2021-01-04
Publication No.: US11562734B2

Publication Date: 2023-01-24
Inventor: Yongxiong Ren , Yang Liu , Heng Liu , Lingzhi Liu , Jie Li , Kaituo Xu , Xiaorui Wang
Applicant: KWAI INC.
Applicant Address: US CA Palo Alto
Assignee: KWAI INC.
Current Assignee: KWAI INC.
Current Assignee Address: US CA Palo Alto
Agency: Arch & Lake LLP
Main IPC: G10L15/16
IPC: G10L15/16 ; G06F5/16 ; G10L25/30

Systems and methods for automatic speech recognition based on graphics processing units

Abstract:

The present disclosure relates to an automatic speech recognition system and a method thereof. The system includes a conformer encoder and a pair of ping-pong buffers. The encoder includes a plurality of encoder layers sequentially executed by one or more graphic processing units. At least one encoder layer includes a first feed forward module, a multi-head self-attention module, a convolution module, and a second feed forward module. The convolution module and the multi-head self-attention module are sandwiched between the first feedforward module and the second feed forward module. The four modules respectively include a plurality of encoder sublayers fused into one or more encoder kernels. The one or more encoder kernels respectively read from one of the pair of ping-pong buffers and write into the other of the pair of ping-pong buffers.

Public/Granted literature

US20220215832A1 SYSTEMS AND METHODS FOR AUTOMATIC SPEECH RECOGNITION BASED ON GRAPHICS PROCESSING UNITS Public/Granted day:2022-07-07

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/16	..利用人工神经网络