Method and apparatus for multi-lingual end-to-end speech recognition

Invention Grant

US10593321B2 Method and apparatus for multi-lingual end-to-end speech recognition 有权

Please log in to see more content

Patent Title: Method and apparatus for multi-lingual end-to-end speech recognition
Application No.: US15843047

Application Date: 2017-12-15
Publication No.: US10593321B2

Publication Date: 2020-03-17
Inventor: Shinji Watanabe , Takaaki Hori , Hiroshi Seki , Jonathan Le Roux , John Hershey
Applicant: Mitsubishi Electric Research Laboratories, Inc.
Applicant Address: US MA Cambridge
Assignee: Mitsubishi Electric Research Laboratories, Inc.
Current Assignee: Mitsubishi Electric Research Laboratories, Inc.
Current Assignee Address: US MA Cambridge
Agent Gennadiy Vinokur; James McAleenan; Hironori Tsukamoto
Main IPC: G10L15/06
IPC: G10L15/06 ; G06N3/08 ; G06N7/00 ; G06N3/04 ; G10L15/02 ; G10L15/16 ; G10L15/197 ; G10L15/22 ; G10L15/00

Method and apparatus for multi-lingual end-to-end speech recognition

Abstract:

A method for training a multi-language speech recognition network includes providing utterance datasets corresponding to predetermined languages, inserting language identification (ID) labels into the utterance datasets, wherein each of the utterance datasets is labelled by each of the language ID labels, concatenating the labeled utterance datasets, generating initial network parameters from the utterance datasets, selecting the initial network parameters according to a predetermined sequence, and training, iteratively, an end-to-end network with a series of the selected initial network parameters and the concatenated labeled utterance datasets until a training result reaches a threshold.

Public/Granted literature

US20190189111A1 Method and Apparatus for Multi-Lingual End-to-End Speech Recognition Public/Granted day:2019-06-20

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）