-
公开(公告)号:US09812123B1
公开(公告)日:2017-11-07
申请号:US14825648
申请日:2015-08-13
Applicant: Google Inc.
Inventor: Jason Sanders , Gabriel Taubman , John J. Lee
CPC classification number: G10L15/08 , G06F17/30746 , G10L15/1815 , G10L15/22 , G10L15/265 , G10L21/0208 , G10L21/0272 , G10L25/48 , G10L2015/225 , H04M3/4936 , H04M2201/40 , H04M2203/352
Abstract: Implementations relate to techniques for providing context-dependent search results. A computer-implemented method includes receiving an audio stream at a computing device during a time interval, the audio stream comprising user speech data and background audio, separating the audio stream into a first substream that includes the user speech data and a second substream that includes the background audio, identifying concepts related to the background audio, generating a set of terms related to the identified concepts, influencing a speech recognizer based on at least one of the terms related to the background audio, and obtaining a recognized version of the user speech data using the speech recognizer.
-
公开(公告)号:US09767801B1
公开(公告)日:2017-09-19
申请号:US15395450
申请日:2016-12-30
Applicant: Google Inc.
Inventor: Jason Sanders , Gabriel Taubman
CPC classification number: G10L15/222 , G10L15/01 , G10L15/22 , G10L2015/221 , G10L2015/223 , G10L2015/228
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for intelligently cancelling user inputs. In one aspect, a requests input by a user is received by a dialog engine. A prompt or notification regarding the request is output by the dialog engine. That the user has taken an action in response to the prompt or notification is determined by the dialog engine. Based on the action taken by the user, that the response corresponds to a potential cancellation command is determined by the dialog system.
-
公开(公告)号:US09570086B1
公开(公告)日:2017-02-14
申请号:US13676283
申请日:2012-11-14
Applicant: Google Inc.
Inventor: Jason Sanders , Gabriel Taubman
CPC classification number: G10L15/222 , G10L15/01 , G10L15/22 , G10L2015/221 , G10L2015/223 , G10L2015/228
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for intelligently cancelling user inputs. In one aspect, a requests input by a user is received by a dialog engine. A prompt or notification regarding the request is output by the dialog engine. That the user has taken an action in response to the prompt or notification is determined by the dialog engine. Based on the action taken by the user, that the response corresponds to a potential cancellation command is determined by the dialog system.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于智能地取消用户输入。 一方面,由用户输入的请求被对话引擎接收。 关于请求的提示或通知由对话框引擎输出。 用户响应于提示或通知而采取的动作由对话引擎确定。 基于用户采取的动作,响应对应于潜在的取消命令由对话系统确定。
-
公开(公告)号:US09529793B1
公开(公告)日:2016-12-27
申请号:US13774082
申请日:2013-02-22
Applicant: Google Inc.
Inventor: Gabriel Taubman , John J. Lee
IPC: G06F17/27
CPC classification number: G06F17/274 , G06F17/30761 , G06F17/30864 , G10L15/187 , G10L25/87
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for resolving ambiguity in received voice queries. An original voice query is received following one or more earlier voice queries, wherein the original voice query includes a pronoun or phrase. In one implementation, a plurality of acoustic parameters is identified for one or more words in the original voice query. A concept represented by the pronoun is identified based on the plurality of acoustic parameters, wherein the concept is associated with a particular query of the one or more earlier queries. The concept is associated with the pronoun. Alternatively, a concept may be associated with a phrase by using grammatical analysis of the query to relate the phrase to a concept derived from a prior query.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于解决接收的语音查询中的歧义。 在一个或多个较早的语音查询之后接收原始语音查询,其中原始语音查询包括代词或短语。 在一个实现中,为原始语音查询中的一个或多个单词识别多个声学参数。 基于所述多个声学参数来识别由所述代词表示的概念,其中所述概念与所述一个或多个较早查询的特定查询相关联。 这个概念与代词相关联。 或者,概念可以通过使用查询的语法分析来将短语与从先前查询导出的概念相关联而与短语相关联。
-
公开(公告)号:US20150169640A1
公开(公告)日:2015-06-18
申请号:US14301154
申请日:2014-06-10
Applicant: Google Inc.
Inventor: Ulrich Buddemeier , Gabriel Taubman , Hartwig Adam , Charles J. Rosenberg , Hartmut Neven , David Petrou , Fernando Brucher
CPC classification number: G06F17/30277 , G06F17/30256 , G06F17/30268 , G06F17/3053 , G06F17/30554 , G06K9/6215 , G06K9/6282 , G06K9/723 , G06K2209/01
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing queries made up of images. In one aspect, a method includes indexing images by image descriptors. The method further includes associating descriptive n-grams with the images. In another aspect, a method includes receiving a query, identifying text describing the query, and performing a search according to the text identified for the query.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于处理由图像组成的查询。 一方面,一种方法包括通过图像描述符索引图像。 该方法还包括将描述性n-gram与图像相关联。 在另一方面,一种方法包括接收查询,识别描述查询的文本,以及根据为查询标识的文本执行搜索。
-
公开(公告)号:US08831957B2
公开(公告)日:2014-09-09
申请号:US13651566
申请日:2012-10-15
Applicant: Google Inc.
Inventor: Gabriel Taubman , Brian Strope
IPC: G10L21/00
CPC classification number: G10L15/30 , G10L15/183 , G10L2015/0635 , G10L2015/226 , H04M1/72572
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing speech recognition using models that are based on where, within a building, a speaker makes an utterance are disclosed. The methods, systems, and apparatus include actions of receiving data corresponding to an utterance, and obtaining location indicia for an area within a building where the utterance was spoken. Further actions include selecting one or more models for speech recognition based on the location indicia, wherein each of the selected one or more models is associated with a weight based on the location indicia. Additionally, the actions include generating a composite model using the selected one or more models and the respective weights of the selected one or more models. And the actions also include generating a transcription of the utterance using the composite model.
Abstract translation: 公开了包括在计算机存储介质上编码的计算机程序的方法,系统和装置,用于使用基于建筑物内的扬声器发出话语的模型进行语音识别。 方法,系统和装置包括接收对应于话语的数据的动作,以及获取用于说出话语的建筑物内的区域的位置标记。 进一步的动作包括基于位置标记来选择用于语音识别的一个或多个模型,其中所选择的一个或多个模型中的每一个与基于位置标记的权重相关联。 另外,动作包括使用所选择的一个或多个模型以及所选择的一个或多个模型的相应权重来生成复合模型。 并且动作还包括使用复合模型生成话语的转录。
-
公开(公告)号:US08686924B2
公开(公告)日:2014-04-01
申请号:US13769784
申请日:2013-02-18
Applicant: Google Inc.
Inventor: Max Braun , Ryan Geiss , Harvey Ho , Thad Eugene Starner , Gabriel Taubman
IPC: G09G5/00
CPC classification number: G09G5/00 , G02B27/017 , G02B2027/014 , G02B2027/0178 , G02C11/10 , G06F1/3231 , G06F1/325 , G06F3/03547 , H04M1/05 , H04M1/6066 , H04M2250/12 , Y02D10/173 , Y02D50/20
Abstract: Systems and methods for selecting an action associated with a power state transition of a head-mounted display (HMD) in the form of eyeglasses are disclosed. A signal may be received from a sensor on a nose bridge of the eyeglasses indicating if the HMD is in use. Based on the received signal, a first power state for the HMD may be determined. Responsive to the determined first power state, an action associated with a power state transition of the HMD from an existing power state to the first power state may be selected. The action may be selected from among a plurality of actions associated with a plurality of state transitions. Also, the action may be a sequence of functions performed by the HMD including modifying an operating state of a primary processing component of the HMD and a detector of the HMD configured to image an environment.
Abstract translation: 公开了用于选择与眼镜形式的头戴式显示器(HMD)的功率状态转换相关联的动作的系统和方法。 可以从眼镜的鼻梁上的传感器接收信号,指示HMD是否在使用。 基于接收到的信号,可以确定HMD的第一功率状态。 响应于所确定的第一功率状态,可以选择与HMD从现有功率状态到第一功率状态的功率状态转换相关联的动作。 可以从与多个状态转换相关联的多个动作中选择动作。 此外,动作可以是由HMD执行的功能序列,包括修改HMD的主处理组件的操作状态和被配置为对环境成像的HMD的检测器。
-
公开(公告)号:US09201903B2
公开(公告)日:2015-12-01
申请号:US14301154
申请日:2014-06-10
Applicant: Google Inc.
Inventor: Ulrich Buddemeier , Gabriel Taubman , Hartwig Adam , Charles J. Rosenberg , Hartmut Neven , David Petrou , Fernando Brucher
CPC classification number: G06F17/30277 , G06F17/30256 , G06F17/30268 , G06F17/3053 , G06F17/30554 , G06K9/6215 , G06K9/6282 , G06K9/723 , G06K2209/01
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing queries made up of images. In one aspect, a method includes indexing images by image descriptors. The method further includes associating descriptive n-grams with the images. In another aspect, a method includes receiving a query, identifying text describing the query, and performing a search according to the text identified for the query.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于处理由图像组成的查询。 一方面,一种方法包括通过图像描述符索引图像。 该方法还包括将描述性n-gram与图像相关联。 在另一方面,一种方法包括接收查询,识别描述查询的文本,以及根据为查询标识的文本执行搜索。
-
9.
公开(公告)号:US09123338B1
公开(公告)日:2015-09-01
申请号:US13804986
申请日:2013-03-14
Applicant: Google Inc.
Inventor: Jason Sanders , Gabriel Taubman , John J. Lee
IPC: G10L15/20 , H04M3/493 , G10L15/26 , G10L21/0208
CPC classification number: G10L15/08 , G06F17/30746 , G10L15/1815 , G10L15/22 , G10L15/265 , G10L21/0208 , G10L21/0272 , G10L25/48 , G10L2015/225 , H04M3/4936 , H04M2201/40 , H04M2203/352
Abstract: Implementations relate to techniques for providing context-dependent search results. A computer-implemented method includes receiving an audio stream at a computing device during a time interval, the audio stream comprising user speech data and background audio, separating the audio stream into a first substream that includes the user speech data and a second substream that includes the background audio, identifying concepts related to the background audio, generating a set of terms related to the identified concepts, influencing a speech recognizer based on at least one of the terms related to the background audio, and obtaining a recognized version of the user speech data using the speech recognizer.
Abstract translation: 实现涉及用于提供上下文相关搜索结果的技术。 计算机实现的方法包括在时间间隔期间在计算设备处接收音频流,所述音频流包括用户语音数据和背景音频,将音频流分离成包括用户语音数据的第一子流和包括用户语音数据的第二子流 背景音频,识别与背景音频相关的概念,产生与所识别的概念相关的一组术语,基于与背景音频相关的术语中的至少一个影响语音识别器,以及获得用户语音的识别版本 使用语音识别器的数据。
-
公开(公告)号:US09792304B1
公开(公告)日:2017-10-17
申请号:US14946027
申请日:2015-11-19
Applicant: Google Inc.
Inventor: Ulrich Buddemeier , Gabriel Taubman , Hartwig Adam , Charles J. Rosenberg , Hartmut Neven , David Petrou , Fernando Brucher
IPC: G06F17/30
CPC classification number: G06F17/30277 , G06F17/30256 , G06F17/30268 , G06F17/3053 , G06F17/30554 , G06K9/6215 , G06K9/6282 , G06K9/723 , G06K2209/01
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing queries made up of images. In one aspect, a method includes indexing images by image descriptors. The method further includes associating descriptive n-grams with the images. In another aspect, a method includes receiving a query, identifying text describing the query, and performing a search according to the text identified for the query.
-
-
-
-
-
-
-
-
-