Frequency warping in a speech recognition system

Invention Grant

US10026396B2 Frequency warping in a speech recognition system 有权

Please log in to see more content

Patent Title: Frequency warping in a speech recognition system
Application No.: US15221491

Application Date: 2016-07-27
Publication No.: US10026396B2

Publication Date: 2018-07-17
Inventor: Andrew W. Senior
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
Agency: Fish & Richardson P.C.
Main IPC: G10L15/16
IPC: G10L15/16 ; G10L15/02 ; G10L15/06 ; G10L25/30 ; G10L21/013

Frequency warping in a speech recognition system

Abstract:

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for receiving a sequence representing an utterance, the sequence comprising a plurality of audio frames; determining one or more warping factors for each audio frame in the sequence using a warping neural network; applying, for each audio frame, the one or more warping factors for the audio frame to the audio frame to generate a respective modified audio frame, wherein the applying comprises using at least one of the warping factors to scale a respective frequency of the audio frame to a new respective frequency in the respective modified audio frame; and decoding the modified audio frames using a decoding neural network, wherein the decoding neural network is configured to output a word sequence that is a transcription of the utterance.

Public/Granted literature

US20170032802A1 FREQUENCY WARPING IN A SPEECH RECOGNITION SYSTEM Public/Granted day:2017-02-02

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/16	..利用人工神经网络