Training method of hybrid frequency acoustic recognition model, and speech recognition method

Invention Grant

US11120789B2 Training method of hybrid frequency acoustic recognition model, and speech recognition method 有权

Please log in to see more content

Patent Title: Training method of hybrid frequency acoustic recognition model, and speech recognition method
Application No.: US16487819

Application Date: 2018-01-26
Publication No.: US11120789B2

Publication Date: 2021-09-14
Inventor: Lichun Fan
Applicant: YUTOU TECHNOLOGY (HANGZHOU) CO., LTD.
Applicant Address: CN Hangzhou
Assignee: YUTOU TECHNOLOGY (HANGZHOU) CO., LTD.
Current Assignee: YUTOU TECHNOLOGY (HANGZHOU) CO., LTD.
Current Assignee Address: CN Hangzhou
Agency: Getech Law LLC
Agent Jun Ye
Priority: CN201710108893.5 20170227
International Application: PCT/CN2018/074320 WO 20180126
International Announcement: WO2018/153214 WO 20180830
Main IPC: G10L15/06
IPC: G10L15/06 ; G10L15/02 ; G10L15/14 ; G10L15/16 ; G10L25/21 ; G10L25/24

Training method of hybrid frequency acoustic recognition model, and speech recognition method

Abstract:

The invention discloses a training method and a speech recognition method for a mixed frequency acoustic recognition model, which belongs to the technical field of speech recognition. The method comprises: obtaining a first-type speech feature of the first speech signal, and processing the first speech data to obtain corresponding first speech training data (S1); obtaining the first-type speech feature of the second speech signal, and processing the second speech data to obtain corresponding second speech training data (S2); obtaining a second-type speech feature of the first speech signal according to a power spectrum of the first speech signal, and obtaining the second-type speech feature of the second speech signal according to a power spectrum of the second speech signal (S3); performing pre-training according to the first speech signal and the second speech signal, so as to form a preliminary recognition model of the hybrid frequency acoustic recognition model (S4); and performing supervised parameter training on the preliminary recognition model according to the first speech training data, the second speech training data and the second-type speech feature, so as to form the hybrid frequency acoustic recognition model (S5). The beneficial effects of the above technical solution are: the recognition model has better robustness and generalization.

Public/Granted literature

US20200380954A1 TRAINING METHOD OF HYBRID FREQUENCY ACOUSTIC RECOGNITION MODEL, AND SPEECH RECOGNITION METHOD Public/Granted day:2020-12-03

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）