Artificial intelligence-based cross-language speech transcription method and apparatus, device and readable medium using Fbank40 acoustic feature format

Invention Grant

US10796700B2 Artificial intelligence-based cross-language speech transcription method and apparatus, device and readable medium using Fbank40 acoustic feature format 有权

Please log in to see more content

Patent Title: Artificial intelligence-based cross-language speech transcription method and apparatus, device and readable medium using Fbank40 acoustic feature format
Application No.: US15978465

Application Date: 2018-05-14
Publication No.: US10796700B2

Publication Date: 2020-10-06
Inventor: Wei Zou , Xiangang Li , Bin Huang
Applicant: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
Applicant Address: CN Beijing
Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
Current Assignee: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) CO., LTD.
Current Assignee Address: CN Beijing
Agency: Brooks Kushman PC
Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@7fd88534
Main IPC: G06F17/28
IPC: G06F17/28 ; G10L15/02 ; G11B20/00 ; G11B20/10 ; H04L12/54 ; G10L15/26 ; G10L15/22 ; G06F40/42 ; G06F40/47 ; G06F40/51 ; G10L15/06

Artificial intelligence-based cross-language speech transcription method and apparatus, device and readable medium using Fbank40 acoustic feature format

Abstract:

An artificial intelligence-based cross-language speech transcription method and apparatus, a device and a readable medium. The method includes pre-processing to-be-transcribed speech data to obtain multiple acoustic features, the to-be-transcribed speech data being represented in a first language; predicting a corresponding translation text after transcription of the speech data according to the multiple acoustic features and a pre-trained cross-language transcription model; wherein the translation text is represented in a second language which is different from the first language. According to the technical solution, it is unnecessary, upon cross-language speech transcription, to perform speech recognition first and then perform machine translation, but to directly perform cross-language transcription according to the pre-trained cross-language transcription model. The technical solution can overcome the problem of error accumulation in the two-step cross-language transcription manner in the prior art, and can effectively improve accuracy and efficiency of the cross-language speech transcription as compared with the prior art.

Public/Granted literature

US20180336900A1 Artificial Intelligence-Based Cross-Language Speech Transcription Method and Apparatus, Device and Readable Medium Public/Granted day:2018-11-22

Information query

Espacenet