-
公开(公告)号:US20170116988A1
公开(公告)日:2017-04-27
申请号:US15365334
申请日:2016-11-30
Applicant: Google Inc.
Inventor: Matthew Sharifi
CPC classification number: G10L15/22 , G10L15/02 , G10L15/063 , G10L15/08 , G10L15/265 , G10L15/285 , G10L17/22 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword.
-
公开(公告)号:US09620114B2
公开(公告)日:2017-04-11
申请号:US15299853
申请日:2016-10-21
Applicant: Google Inc.
Inventor: Matthew Sharifi
CPC classification number: G10L15/22 , G10L15/02 , G10L15/08 , G10L15/18 , G10L15/1815 , G10L15/28 , G10L25/51 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, receiving audio data; determining that an initial portion of the audio data corresponds to an initial portion of a hotword; in response to determining that the initial portion of the audio data corresponds to the initial portion of the hotword, selecting, from among a set of one or more actions that are performed when the entire hotword is detected, a subset of the one or more actions; and causing one or more actions of the subset to be performed.
-
公开(公告)号:US09606716B2
公开(公告)日:2017-03-28
申请号:US14522927
申请日:2014-10-24
Applicant: GOOGLE INC.
Inventor: Matthew Sharifi , David Petrou
IPC: G06F3/0486 , G06F3/16 , G06F9/54 , G06F3/0484 , G06F3/0488
CPC classification number: G06F3/0486 , G06F3/04842 , G06F3/0488 , G06F3/04883 , G06F3/167 , G06F9/543 , G06F2203/04803
Abstract: Implementations provide an improved drag-and-drop operation on a mobile device. For example, a method includes identifying a drag area in a user interface of a first mobile application in response to a drag command, identifying an entity from a data store based on recognition performed on content in the drag area, receiving a drop location associated with a second mobile application, determining an action to perform in the second mobile application based on the drop location, and performing the action in the second mobile action using the entity. Another method may include receiving a selection of a smart copy control for a text input control in a first mobile application, receiving a selected area of a display generated by a second mobile application, identifying an entity in the selected area, automatically navigating back to the text input control, and pasting a description of the entity in the text input control.
-
公开(公告)号:US09563671B2
公开(公告)日:2017-02-07
申请号:US15189185
申请日:2016-06-22
Applicant: GOOGLE INC.
Inventor: Alfred Zalmon Spector , David Petrou , Blaise Aguera-Arcas , Matthew Sharifi
CPC classification number: G06F21/54 , G06F17/30539 , G06F17/30876 , G06F21/6218 , G06F2221/0724 , G06T1/0021 , G06T1/20 , G06T1/60 , G06T11/60
Abstract: Systems and methods prevent or restrict the mining of content on a mobile device. For example, a method may include identifying a mining-restriction mark in low order bits or high order bits in a frame buffer of a mobile device and determining whether the mining-restriction mark prevents mining of content. Mining includes non-transient storage of a copy or derivations of data in the frame buffer. The method may also include preventing the mining of data in the frame buffer when the mining-restriction mark prevents mining.
Abstract translation: 系统和方法防止或限制在移动设备上挖掘内容。 例如,一种方法可以包括在移动设备的帧缓冲器中识别低位或高位的采矿限制标记,并且确定挖掘限制标记是否防止挖掘内容。 挖掘包括在帧缓冲器中的副本或数据导出的非瞬时存储。 该方法还可以包括当采矿限制标记防止采矿时,防止在帧缓冲器中挖掘数据。
-
公开(公告)号:US09542948B2
公开(公告)日:2017-01-10
申请号:US14612830
申请日:2015-02-03
Applicant: Google Inc.
Inventor: Dominik Roblek , Matthew Sharifi , Raziel Alvarez Guevara
CPC classification number: G10L17/18 , G10L17/005
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker verification. The methods, systems, and apparatus include actions of inputting speech data that corresponds to a particular utterance to a first neural network and determining an evaluation vector based on output at a hidden layer of the first neural network. Additional actions include obtaining a reference vector that corresponds to a past utterance of a particular speaker. Further actions include inputting the evaluation vector and the reference vector to a second neural network that is trained on a set of labeled pairs of feature vectors to identify whether speakers associated with the labeled pairs of feature vectors are the same speaker. More actions include determining, based on an output of the second neural network, whether the particular utterance was likely spoken by the particular speaker.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的用于说话者验证的计算机程序。 方法,系统和装置包括将对应于特定话语的语音数据输入到第一神经网络并基于第一神经网络的隐藏层处的输出来确定评估向量的动作。 附加动作包括获得对应于特定说话者的过去话语的参考矢量。 进一步的动作包括将评估向量和参考矢量输入到第二神经网络,该第二神经网络被训练在一组标记的特征矢量对上,以识别与标记的特征矢量对相关联的扬声器是否是相同的扬声器。 更多的动作包括基于第二神经网络的输出确定特定话语是否可能由特定说话者说出。
-
公开(公告)号:US20170004132A1
公开(公告)日:2017-01-05
申请号:US15267463
申请日:2016-09-16
Applicant: Google Inc.
Inventor: Matthew Sharifi
IPC: G06F17/30
CPC classification number: G06F16/487 , G06F16/245 , G06F16/2455 , G06F16/24578 , G06F16/433 , G06F16/435 , G06F16/437 , G06F16/489 , G06F16/685 , G06F16/7834 , G06F16/9535 , G06F16/955 , G06Q30/02 , G06Q30/0631 , G10L25/54
Abstract: Methods, systems, and apparatus for receiving a natural language query of a user, and environmental data, identifying a media item based on the environmental data, determining an entity type based on the natural language query, selecting an entity associated with the media item that matches the entity type, selecting, from a media consumption database that identifies media items that have been indicated as consumed by the user, one or more media items that have been indicated as consumed by the user and that are associated with the selected entity, and providing a response to the query based on selecting the one or more media items that have been indicated as consumed by the user and that are associated with the selected entity.
Abstract translation: 用于接收用户的自然语言查询的方法,系统和装置,以及环境数据,基于环境数据识别媒体项目,基于自然语言查询确定实体类型,选择与媒体项目相关联的实体, 匹配实体类型,从媒体消费数据库中选择,该媒体消费数据库标识已被指示为用户消费的媒体项目,已被指示为由用户消费并且与所选择的实体相关联的一个或多个媒体项目,以及 基于选择已被指示为由用户消费并且与所选择的实体相关联的一个或多个媒体项来向所述查询提供响应。
-
公开(公告)号:US09520130B2
公开(公告)日:2016-12-13
申请号:US15001894
申请日:2016-01-20
Applicant: Google Inc.
Inventor: Matthew Sharifi
CPC classification number: G10L15/22 , G06F3/04842 , G06F3/167 , G10L15/063 , G10L15/08 , G10L15/18 , G10L15/265 , G10L15/30 , G10L2015/0631 , G10L2015/0638 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for obtaining, for each of multiple words or sub-words, audio data corresponding to multiple users speaking the word or sub-word; training, for each of the multiple words or sub-words, a pre-computed hotword model for the word or sub-word based on the audio data for the word or sub-word; receiving a candidate hotword from a computing device; identifying one or more pre-computed hotword models that correspond to the candidate hotword; and providing the identified, pre-computed hotword models to the computing device.
-
公开(公告)号:US09514753B2
公开(公告)日:2016-12-06
申请号:US14523198
申请日:2014-10-24
Applicant: Google Inc.
Inventor: Matthew Sharifi , Ignacio Lopez Moreno , Ludwig Schmidt
CPC classification number: G10L17/02 , G10L17/005 , G10L17/08 , G10L17/18 , G10L25/51
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing speaker identification. In some implementations, an utterance vector that is derived from an utterance is obtained. Hash values are determined for the utterance vector according to multiple different hash functions. A set of speaker vectors from a plurality of hash tables is determined using the hash values, where each speaker vector was derived from one or more utterances of a respective speaker. The speaker vectors in the set are compared with the utterance vector. A speaker vector is selected based on comparing the speaker vectors in the set with the utterance vector.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的用于执行说话人识别的计算机程序。 在一些实现中,获得从话语导出的话语向量。 根据多个不同的哈希函数为发声向量确定哈希值。 使用散列值来确定来自多个散列表的一组扬声器向量,其中每个扬声器向量是从相应说话者的一个或多个话语导出的。 将集合中的扬声器矢量与发声矢量进行比较。 基于将集合中的扬声器矢量与发声矢量进行比较来选择扬声器矢量。
-
公开(公告)号:US20160343371A1
公开(公告)日:2016-11-24
申请号:US15224944
申请日:2016-08-01
Applicant: Google Inc.
Inventor: Matthew Sharifi , Gheorghe Postelnicu
CPC classification number: G10L15/22 , G06F17/30026 , G06F17/30654 , G06F17/30684 , G06F17/30752 , G10L15/08 , G10L15/1815 , G10L15/24 , G10L15/30 , G10L2015/088 , G10L2015/223 , G10L2015/225
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving audio data encoding an utterance and environmental data, obtaining a transcription of the utterance, identifying an entity using the environmental data, submitting a query to a natural language query processing engine, wherein the query includes at least a portion of the transcription and data that identifies the entity, and obtaining one or more results of the query.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于接收编码话语和环境数据的音频数据,获得话语的转录,使用环境数据识别实体,向自然语言提交查询 查询处理引擎,其中查询包括识别实体的转录和数据的至少一部分,以及获得查询的一个或多个结果。
-
公开(公告)号:US09502026B2
公开(公告)日:2016-11-22
申请号:US14991092
申请日:2016-01-08
Applicant: Google Inc.
Inventor: Matthew Sharifi
CPC classification number: G10L15/22 , G10L15/02 , G10L15/08 , G10L15/18 , G10L15/1815 , G10L15/28 , G10L25/51 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, receiving audio data; determining that an initial portion of the audio data corresponds to an initial portion of a hotword; in response to determining that the initial portion of the audio data corresponds to the initial portion of the hotword, selecting, from among a set of one or more actions that are performed when the entire hotword is detected, a subset of the one or more actions; and causing one or more actions of the subset to be performed.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,接收音频数据; 确定音频数据的初始部分对应于热门词的初始部分; 响应于确定音频数据的初始部分对应于热门词的初始部分,从在检测到整个热词时执行的一个或多个动作的集合中选择一个或多个动作的子集 ; 并且引起所述子集的一个或多个动作被执行。
-
-
-
-
-
-
-
-
-