Invention Grant
- Patent Title: Method and apparatus for open-vocabulary end-to-end speech recognition
-
Application No.: US15843055Application Date: 2017-12-15
-
Publication No.: US10672388B2Publication Date: 2020-06-02
- Inventor: Takaaki Hori , Shinji Watanabe , John Hershey
- Applicant: Mitsubishi Electric Research Laboratories, Inc.
- Applicant Address: US MA Cambridge
- Assignee: Mitsubishi Electric Research Laboratories, Inc.
- Current Assignee: Mitsubishi Electric Research Laboratories, Inc.
- Current Assignee Address: US MA Cambridge
- Agent Gennadiy Vinokur; James McAleenan; Hironori Tsukamoto
- Main IPC: G10L15/16
- IPC: G10L15/16 ; G10L15/19 ; G10L15/183 ; G10L15/02 ; G10L15/187 ; G10L15/22

Abstract:
A speech recognition system includes an input device to receive voice sounds, one or more processors, and one or more storage devices storing parameters and program modules including instructions which cause the one or more processors to perform operations. The operations include extracting an acoustic feature sequence from audio waveform data converted from the voice sounds, encoding the acoustic feature sequence into a hidden vector sequence using an encoder network having encoder network parameters, predicting first output label sequence probabilities by feeding the hidden vector sequence to a decoder network having decoder network parameters, predicting second output level sequence probabilities by a hybrid network using character-base language models (LMs) and word-level LMs; and searching, using a label sequence search module, for an output label sequence having a highest sequence probability by combining the first and second output label sequence probabilities provided from the decoder network and the hybrid network.
Public/Granted literature
- US20190189115A1 Method and Apparatus for Open-Vocabulary End-to-End Speech Recognition Public/Granted day:2019-06-20
Information query