-
公开(公告)号:US20170330586A1
公开(公告)日:2017-11-16
申请号:US15151362
申请日:2016-05-10
Applicant: Google Inc.
Inventor: Dominik Roblek , Matthew Sharifi
CPC classification number: G10L25/30 , G06F11/0715 , G06F11/079 , G06N3/0454 , G06N3/084 , G06N3/088
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for frequency based audio analysis using neural networks. One of the methods includes training a neural network that includes a plurality of neural network layers on training data, wherein the neural network is configured to receive frequency domain features of an audio sample and to process the frequency domain features to generate a neural network output for the audio sample, wherein the neural network comprises (i) a convolutional layer that is configured to map frequency domain features to logarithmic scaled frequency domain features, wherein the convolutional layer comprises one or more convolutional layer filters, and (ii) one or more other neural network layers having respective layer parameters that are configured to process the logarithmic scaled frequency domain features to generate the neural network output.
-
公开(公告)号:US20170277994A1
公开(公告)日:2017-09-28
申请号:US15082653
申请日:2016-03-28
Applicant: Google Inc.
Inventor: Matthew Sharifi , Jakob Nicolaus Foerster
CPC classification number: G06N3/08 , G06F9/5044 , G06F9/505 , G06F9/5094 , G06N3/0454 , H04L67/42 , Y02D10/22
Abstract: Computer-implemented techniques can include obtaining, by a client computing device, a digital media item and a request for a processing task on the digital item and determining a set of operating parameters based on (i) available computing resources at the client computing device and (ii) a condition of a network. Based on the set of operating parameters, the client computing device or a server computing device can select one of a plurality of artificial neural networks (ANNs), each ANN defining which portions of the processing task are to be performed by the client and server computing devices. The client and server computing devices can coordinate processing of the processing task according to the selected ANN. The client computing device can also obtain final processing results corresponding to a final evaluation of the processing task and generate an output based on the final processing results.
-
公开(公告)号:US20170257650A1
公开(公告)日:2017-09-07
申请号:US15603357
申请日:2017-05-23
Applicant: GOOGLE INC.
Inventor: Matthew Sharifi
IPC: H04N21/235 , H04N21/25 , H04N21/466 , H04N21/84 , H04N21/234
CPC classification number: H04N21/2353 , H04N21/23418 , H04N21/251 , H04N21/4668 , H04N21/84
Abstract: Systems and methods for matching live media content are disclosed. At a server, obtaining first media content from a client device, herein the first media content corresponds to a portion of media content being played on the client device, and the first media content is associated with a predefined expiration time; obtaining second media content from one or more content feeds, wherein the second media content also corresponds to a portion of the media content being played on the client device; in accordance with a determination that the second media content corresponds to a portion of the media content that has been played on the client device: before the predefined expiration time, obtaining third media content corresponding to the media content being played on the client device, from the one or more content feeds; and comparing the first media content with the third media content.
-
公开(公告)号:US20170221472A1
公开(公告)日:2017-08-03
申请号:US15477360
申请日:2017-04-03
Applicant: Google Inc.
Inventor: Matthew Sharifi , Jakob Nicolaus Foerster
CPC classification number: G10L13/043 , G06F17/274 , G06F17/2775 , G10L13/08
Abstract: In some implementations, a language proficiency of a user of a client device is determined by one or more computers. The one or more computers then determines a text segment for output by a text-to-speech module based on the determined language proficiency of the user. After determining the text segment for output, the one or more computers generates audio data including a synthesized utterance of the text segment. The audio data including the synthesized utterance of the text segment is then provided to the client device for output.
-
公开(公告)号:US09720955B1
公开(公告)日:2017-08-01
申请号:US15289661
申请日:2016-10-10
Applicant: Google Inc.
Inventor: Jing Cao , Alexa Greenberg , Abhanshu Sharma , Yanchao Su , Nicholas Kong , Muhammad Mohsin , Jacek Jurewicz , Wei Huang , Matthew Sharifi , Benjamin Sidhom
IPC: G06F3/00 , G06F17/30 , G06F3/0488 , G06F3/0482
CPC classification number: H04L51/046 , G06F3/0237 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/30398 , G06F17/30554 , G06F17/30643 , G06F17/30864 , G06F17/30867 , G06F17/30973
Abstract: A computing device is described that includes at least one processor and a memory including instructions that when executed cause the at least one processor to output, for display, a graphical keyboard comprising a plurality of keys, and determine, based on an indication of a selection of one or more keys from the plurality of keys, text of an electronic communication. The instructions, when executed, further cause the at least one processor to identify, based at least in part on the text, a searchable entity or trigger phrase, generate, based on the searchable entity or trigger phrase, a search query, and output, for display, within the graphical keyboard, a graphical indication to indicate that the computing device generated the search query.
-
公开(公告)号:US09711148B1
公开(公告)日:2017-07-18
申请号:US13944975
申请日:2013-07-18
Applicant: Google Inc.
Inventor: Matthew Sharifi , Dominik Roblek
IPC: G10L17/02
Abstract: A processing system receives an audio signal encoding an utterance and determines that a first portion of the audio signal corresponds to a predefined phrase. The processing system accesses one or more text-dependent models associated with the predefined phrase and determines a first confidence based on the one or more text-dependent models associated with the predefined phrase, the first confidence corresponding to a first likelihood that a particular speaker spoke the utterance. The processing system determines a second confidence for a second portion of the audio signal using one or more text-independent models, the second confidence corresponding to a second likelihood that the particular speaker spoke the utterance. The processing system then determines that the particular speaker spoke the utterance based at least in part on the first confidence and the second confidence.
-
公开(公告)号:US20170193998A1
公开(公告)日:2017-07-06
申请号:US15466979
申请日:2017-03-23
Applicant: Google Inc.
Inventor: Matthew Sharifi
CPC classification number: G10L15/22 , G10L15/02 , G10L15/08 , G10L15/18 , G10L15/1815 , G10L15/28 , G10L25/51 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, receiving audio data; determining that an initial portion of the audio data corresponds to an initial portion of a hotword; in response to determining that the initial portion of the audio data corresponds to the initial portion of the hotword, selecting, from among a set of one or more actions that are performed when the entire hotword is detected, a subset of the one or more actions; and causing one or more actions of the subset to be performed.
-
公开(公告)号:US09699597B2
公开(公告)日:2017-07-04
申请号:US14961803
申请日:2015-12-07
Applicant: GOOGLE INC.
Inventor: Thomas Deselaers , Daniel Martin Keysers , Stephan Robert Gammeter , Matthew Sharifi
CPC classification number: H04W4/80 , G06Q20/3278 , H04B5/0031 , H04W40/244
Abstract: Forwarding wireless signals comprises a user and a counterpart opening secure applications on a user computing device and a counterpart computing device, respectively. The user places the user computing device within range of a wireless signal, such as a wireless signal provided by a point of sale (“POS”) terminal. The user computing device forwards the wireless signal from the POS terminal to the counterpart computing device. The user computing device forwards the wireless signal from the counterpart computing device to the POS terminal. Thus, the counterpart computing device may conduct a transaction with the POS terminal as if the counterpart computing device were at the location of the POS terminal. The counterpart computing device may also receive a forwarded beacon signal comprising data, such as an offer, provided by the POS terminal or another suitable beacon transmission device at the merchant location.
-
公开(公告)号:US20170164139A1
公开(公告)日:2017-06-08
申请号:US14961803
申请日:2015-12-07
Applicant: GOOGLE INC.
Inventor: Thomas Deselaers , Daniel Martin Keysers , Stephan Robert Gammeter , Matthew Sharifi
CPC classification number: H04W4/80 , G06Q20/3278 , H04B5/0031 , H04W40/244
Abstract: Forwarding wireless signals comprises a user and a counterpart opening secure applications on a user computing device and a counterpart computing device, respectively. The user places the user computing device within range of a wireless signal, such as a wireless signal provided by a point of sale (“POS”) terminal. The user computing device forwards the wireless signal from the POS terminal to the counterpart computing device. The user computing device forwards the wireless signal from the counterpart computing device to the POS terminal. Thus, the counterpart computing device may conduct a transaction with the POS terminal as if the counterpart computing device were at the location of the POS terminal. The counterpart computing device may also receive a forwarded beacon signal comprising data, such as an offer, provided by the POS terminal or another suitable beacon transmission device at the merchant location.
-
公开(公告)号:US20170133014A1
公开(公告)日:2017-05-11
申请号:US15410180
申请日:2017-01-19
Applicant: Google Inc.
Inventor: Matthew Sharifi , Gheorghe Postelnicu
CPC classification number: G10L15/22 , G06F17/30026 , G06F17/30654 , G06F17/30684 , G06F17/30752 , G10L15/08 , G10L15/1815 , G10L15/24 , G10L15/30 , G10L2015/088 , G10L2015/223 , G10L2015/225
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data encoding an utterance and environmental data, obtaining a transcription of the utterance, identifying an entity using the environmental data, submitting a query to a natural language query processing engine, wherein the query includes at least a portion of the transcription and data that identifies the entity, and obtaining one or more results of the query.
-
-
-
-
-
-
-
-
-