Invention Grant
- Patent Title: Frequency warping in a speech recognition system
-
Application No.: US15221491Application Date: 2016-07-27
-
Publication No.: US10026396B2Publication Date: 2018-07-17
- Inventor: Andrew W. Senior
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G10L15/16
- IPC: G10L15/16 ; G10L15/02 ; G10L15/06 ; G10L25/30 ; G10L21/013

Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for receiving a sequence representing an utterance, the sequence comprising a plurality of audio frames; determining one or more warping factors for each audio frame in the sequence using a warping neural network; applying, for each audio frame, the one or more warping factors for the audio frame to the audio frame to generate a respective modified audio frame, wherein the applying comprises using at least one of the warping factors to scale a respective frequency of the audio frame to a new respective frequency in the respective modified audio frame; and decoding the modified audio frames using a decoding neural network, wherein the decoding neural network is configured to output a word sequence that is a transcription of the utterance.
Public/Granted literature
- US20170032802A1 FREQUENCY WARPING IN A SPEECH RECOGNITION SYSTEM Public/Granted day:2017-02-02
Information query