FREQUENCY BASED AUDIO ANALYSIS USING NEURAL NETWORKS

    公开(公告)号:US20170330586A1

    公开(公告)日:2017-11-16

    申请号:US15151362

    申请日:2016-05-10

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for frequency based audio analysis using neural networks. One of the methods includes training a neural network that includes a plurality of neural network layers on training data, wherein the neural network is configured to receive frequency domain features of an audio sample and to process the frequency domain features to generate a neural network output for the audio sample, wherein the neural network comprises (i) a convolutional layer that is configured to map frequency domain features to logarithmic scaled frequency domain features, wherein the convolutional layer comprises one or more convolutional layer filters, and (ii) one or more other neural network layers having respective layer parameters that are configured to process the logarithmic scaled frequency domain features to generate the neural network output.

    ADAPTIVE ARTIFICIAL NEURAL NETWORK SELECTION TECHNIQUES

    公开(公告)号:US20170277994A1

    公开(公告)日:2017-09-28

    申请号:US15082653

    申请日:2016-03-28

    Applicant: Google Inc.

    Abstract: Computer-implemented techniques can include obtaining, by a client computing device, a digital media item and a request for a processing task on the digital item and determining a set of operating parameters based on (i) available computing resources at the client computing device and (ii) a condition of a network. Based on the set of operating parameters, the client computing device or a server computing device can select one of a plurality of artificial neural networks (ANNs), each ANN defining which portions of the processing task are to be performed by the client and server computing devices. The client and server computing devices can coordinate processing of the processing task according to the selected ANN. The client computing device can also obtain final processing results corresponding to a final evaluation of the processing task and generate an output based on the final processing results.

    SYSTEMS AND METHODS FOR LIVE MEDIA CONTENT MATCHING

    公开(公告)号:US20170257650A1

    公开(公告)日:2017-09-07

    申请号:US15603357

    申请日:2017-05-23

    Applicant: GOOGLE INC.

    Inventor: Matthew Sharifi

    Abstract: Systems and methods for matching live media content are disclosed. At a server, obtaining first media content from a client device, herein the first media content corresponds to a portion of media content being played on the client device, and the first media content is associated with a predefined expiration time; obtaining second media content from one or more content feeds, wherein the second media content also corresponds to a portion of the media content being played on the client device; in accordance with a determination that the second media content corresponds to a portion of the media content that has been played on the client device: before the predefined expiration time, obtaining third media content corresponding to the media content being played on the client device, from the one or more content feeds; and comparing the first media content with the third media content.

    ADAPTIVE TEXT-TO-SPEECH OUTPUTS
    134.
    发明申请

    公开(公告)号:US20170221472A1

    公开(公告)日:2017-08-03

    申请号:US15477360

    申请日:2017-04-03

    Applicant: Google Inc.

    CPC classification number: G10L13/043 G06F17/274 G06F17/2775 G10L13/08

    Abstract: In some implementations, a language proficiency of a user of a client device is determined by one or more computers. The one or more computers then determines a text segment for output by a text-to-speech module based on the determined language proficiency of the user. After determining the text segment for output, the one or more computers generates audio data including a synthesized utterance of the text segment. The audio data including the synthesized utterance of the text segment is then provided to the client device for output.

    Dual model speaker identification
    136.
    发明授权

    公开(公告)号:US09711148B1

    公开(公告)日:2017-07-18

    申请号:US13944975

    申请日:2013-07-18

    Applicant: Google Inc.

    CPC classification number: G10L17/02 G10L17/10 G10L17/22

    Abstract: A processing system receives an audio signal encoding an utterance and determines that a first portion of the audio signal corresponds to a predefined phrase. The processing system accesses one or more text-dependent models associated with the predefined phrase and determines a first confidence based on the one or more text-dependent models associated with the predefined phrase, the first confidence corresponding to a first likelihood that a particular speaker spoke the utterance. The processing system determines a second confidence for a second portion of the audio signal using one or more text-independent models, the second confidence corresponding to a second likelihood that the particular speaker spoke the utterance. The processing system then determines that the particular speaker spoke the utterance based at least in part on the first confidence and the second confidence.

    Wireless signal forwarding
    138.
    发明授权

    公开(公告)号:US09699597B2

    公开(公告)日:2017-07-04

    申请号:US14961803

    申请日:2015-12-07

    Applicant: GOOGLE INC.

    CPC classification number: H04W4/80 G06Q20/3278 H04B5/0031 H04W40/244

    Abstract: Forwarding wireless signals comprises a user and a counterpart opening secure applications on a user computing device and a counterpart computing device, respectively. The user places the user computing device within range of a wireless signal, such as a wireless signal provided by a point of sale (“POS”) terminal. The user computing device forwards the wireless signal from the POS terminal to the counterpart computing device. The user computing device forwards the wireless signal from the counterpart computing device to the POS terminal. Thus, the counterpart computing device may conduct a transaction with the POS terminal as if the counterpart computing device were at the location of the POS terminal. The counterpart computing device may also receive a forwarded beacon signal comprising data, such as an offer, provided by the POS terminal or another suitable beacon transmission device at the merchant location.

    WIRELESS SIGNAL FORWARDING
    139.
    发明申请

    公开(公告)号:US20170164139A1

    公开(公告)日:2017-06-08

    申请号:US14961803

    申请日:2015-12-07

    Applicant: GOOGLE INC.

    CPC classification number: H04W4/80 G06Q20/3278 H04B5/0031 H04W40/244

    Abstract: Forwarding wireless signals comprises a user and a counterpart opening secure applications on a user computing device and a counterpart computing device, respectively. The user places the user computing device within range of a wireless signal, such as a wireless signal provided by a point of sale (“POS”) terminal. The user computing device forwards the wireless signal from the POS terminal to the counterpart computing device. The user computing device forwards the wireless signal from the counterpart computing device to the POS terminal. Thus, the counterpart computing device may conduct a transaction with the POS terminal as if the counterpart computing device were at the location of the POS terminal. The counterpart computing device may also receive a forwarded beacon signal comprising data, such as an offer, provided by the POS terminal or another suitable beacon transmission device at the merchant location.

Patent Agency Ranking