Deployed end-to-end speech recognition

Invention Grant

US10319374B2 Deployed end-to-end speech recognition 有权

Please log in to see more content

Patent Title: Deployed end-to-end speech recognition
Application No.: US15358083

Application Date: 2016-11-21
Publication No.: US10319374B2

Publication Date: 2019-06-11
Inventor: Bryan Catanzaro , Jingdong Chen , Mike Chrzanowski , Erich Elsen , Jesse Engel , Christopher Fougner , Xu Han , Awni Hannun , Ryan Prenger , Sanjeev Satheesh , Shubhabrata Sengupta , Dani Yogatama , Chong Wang , Jun Zhan , Zhenyao Zhu , Dario Amodei
Applicant: Baidu USA, LLC
Applicant Address: US CA Sunnyvale
Assignee: Baidu USA, LLC
Current Assignee: Baidu USA, LLC
Current Assignee Address: US CA Sunnyvale
Agency: North Weber & Baugh LLP
Main IPC: G10L15/02
IPC: G10L15/02 ; G10L15/06 ; G10L15/14 ; G10L15/16 ; G10L15/197 ; G10L25/21 ; G10L25/18 ; G06N3/04 ; G06N3/08 ; G10L15/183

Abstract:

Embodiments of end-to-end deep learning systems and methods are disclosed to recognize speech of vastly different languages, such as English or Mandarin Chinese. In embodiments, the entire pipelines of hand-engineered components are replaced with neural networks, and the end-to-end learning allows handling a diverse variety of speech including noisy environments, accents, and different languages. Using a trained embodiment and an embodiment of a batch dispatch technique with GPUs in a data center, an end-to-end deep learning system can be inexpensively deployed in an online setting, delivering low latency when serving users at scale.

Public/Granted literature

US20170148433A1 DEPLOYED END-TO-END SPEECH RECOGNITION Public/Granted day:2017-05-25

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/02	.语音识别的特征提取；识别单位的选择