Invention Grant
- Patent Title: Knowledge transfer in permutation invariant training for single-channel multi-talker speech recognition
-
Application No.: US15940197Application Date: 2018-03-29
-
Publication No.: US10699697B2Publication Date: 2020-06-30
- Inventor: Yanmin Qian , Dong Yu
- Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
- Applicant Address: CN Shenzhen
- Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
- Current Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
- Current Assignee Address: CN Shenzhen
- Agency: Sughrue Mion, PLLC
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G10L15/06

Abstract:
Provided are a speech recognition training processing method and an apparatus including the same. The speech recognition training processing method includes acquiring a multi-talker mixed speech signal from a plurality of speakers, performing permutation invariant training (PIT) model training on the multi-talker mixed speech signal based on knowledge from a single-talker speech recognition model and updating a multi-talker speech recognition model based on a result of the PIT model training.
Public/Granted literature
- US20190304437A1 KNOWLEDGE TRANSFER IN PERMUTATION INVARIANT TRAINING FOR SINGLE-CHANNEL MULTI-TALKER SPEECH RECOGNITION Public/Granted day:2019-10-03
Information query