Invention Grant
- Patent Title: Training method of hybrid frequency acoustic recognition model, and speech recognition method
-
Application No.: US16487819Application Date: 2018-01-26
-
Publication No.: US11120789B2Publication Date: 2021-09-14
- Inventor: Lichun Fan
- Applicant: YUTOU TECHNOLOGY (HANGZHOU) CO., LTD.
- Applicant Address: CN Hangzhou
- Assignee: YUTOU TECHNOLOGY (HANGZHOU) CO., LTD.
- Current Assignee: YUTOU TECHNOLOGY (HANGZHOU) CO., LTD.
- Current Assignee Address: CN Hangzhou
- Agency: Getech Law LLC
- Agent Jun Ye
- Priority: CN201710108893.5 20170227
- International Application: PCT/CN2018/074320 WO 20180126
- International Announcement: WO2018/153214 WO 20180830
- Main IPC: G10L15/06
- IPC: G10L15/06 ; G10L15/02 ; G10L15/14 ; G10L15/16 ; G10L25/21 ; G10L25/24

Abstract:
The invention discloses a training method and a speech recognition method for a mixed frequency acoustic recognition model, which belongs to the technical field of speech recognition. The method comprises: obtaining a first-type speech feature of the first speech signal, and processing the first speech data to obtain corresponding first speech training data (S1); obtaining the first-type speech feature of the second speech signal, and processing the second speech data to obtain corresponding second speech training data (S2); obtaining a second-type speech feature of the first speech signal according to a power spectrum of the first speech signal, and obtaining the second-type speech feature of the second speech signal according to a power spectrum of the second speech signal (S3); performing pre-training according to the first speech signal and the second speech signal, so as to form a preliminary recognition model of the hybrid frequency acoustic recognition model (S4); and performing supervised parameter training on the preliminary recognition model according to the first speech training data, the second speech training data and the second-type speech feature, so as to form the hybrid frequency acoustic recognition model (S5). The beneficial effects of the above technical solution are: the recognition model has better robustness and generalization.
Public/Granted literature
- US20200380954A1 TRAINING METHOD OF HYBRID FREQUENCY ACOUSTIC RECOGNITION MODEL, AND SPEECH RECOGNITION METHOD Public/Granted day:2020-12-03
Information query