Segment-based speaker verification using dynamically generated phrases
    41.
    发明授权
    Segment-based speaker verification using dynamically generated phrases 有权
    使用动态生成的短语进行基于段的演讲者验证

    公开(公告)号:US08812320B1

    公开(公告)日:2014-08-19

    申请号:US14242098

    申请日:2014-04-01

    Applicant: Google Inc.

    CPC classification number: G10L17/24 G10L15/02 G10L17/04 G10L2015/025

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for verifying an identity of a user. The methods, systems, and apparatus include actions of receiving a request for a verification phrase for verifying an identity of a user. Additional actions include, in response to receiving the request for the verification phrase for verifying the identity of the user, identifying subwords to be included in the verification phrase and in response to identifying the subwords to be included in the verification phrase, obtaining a candidate phrase that includes at least some of the identified subwords as the verification phrase. Further actions include providing the verification phrase as a response to the request for the verification phrase for verifying the identity of the user.

    Abstract translation: 方法,系统和装置,包括编码在计算机存储介质上的计算机程序,用于验证用户的身份。 方法,系统和装置包括接收用于验证用户身份的验证短语的请求的动作。 附加动作包括响应于接收到用于验证用户身份的验证短语的请求,识别要包括在验证短语中的子词,并且响应于识别要包括在验证短语中的子词,获得候选短语 其包括至少一些所识别的子词作为验证短语。 进一步的操作包括提供验证短语作为对用于验证用户身份的验证短语的请求的响应。

    Synchronized content playback related to content recognition
    42.
    发明授权
    Synchronized content playback related to content recognition 有权
    与内容识别相关的同步内容播放

    公开(公告)号:US08699862B1

    公开(公告)日:2014-04-15

    申请号:US13760238

    申请日:2013-02-06

    Applicant: Google Inc.

    CPC classification number: G11B27/10 G06F17/3074 G11B27/28 H04N9/8211

    Abstract: Systems, methods, routines and/or techniques for synchronized content playback related to content recognition are described. A software program may cause a video to play synchronously with a song, for example, a song that is playing in an ambient environment such as a café or bar. In some embodiments, a client device may sense a song and the client device may communicate audio data related to the song to a remote server, and the remote server may identify a song that is related to the audio data. The remote server may also identify one or more videos (e.g., in a video database) that relate to the song. The remote server may communicate one or more of the videos (e.g., a link/URL) back to the client device such that the client device can play one of the videos synchronously with the song, even if playback of the video is delayed.

    Abstract translation: 描述了与内容识别相关的同步内容回放的系统,方法,例程和/或技术。 软件程序可以使视频与歌曲同步地播放,例如,在诸如咖啡馆或酒吧的环境环境中播放的歌曲。 在一些实施例中,客户端设备可以感测歌曲,并且客户端设备可以将与歌曲相关的音频数据传送到远程服务器,并且远程服务器可以标识与音频数据相关的歌曲。 远程服务器还可以标识与歌曲相关的一个或多个视频(例如,在视频数据库中)。 远程服务器可以将一个或多个视频(例如,链接/ URL)传送回客户端设备,使得即使视频的播放被延迟,客户端设备也可以与歌曲同步地播放视频之一。

    Systems and Methods for Live Media Content Matching
    43.
    发明申请
    Systems and Methods for Live Media Content Matching 有权
    Live Media内容匹配的系统和方法

    公开(公告)号:US20140082651A1

    公开(公告)日:2014-03-20

    申请号:US13623031

    申请日:2012-09-19

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    Abstract: Systems and methods for matching live media content are disclosed. At a server, obtaining first media content from a client device, herein the first media content corresponds to a portion of media content being played on the client device, and the first media content is associated with a predefined expiration time; obtaining second media content from one or more content feeds, wherein the second media content also corresponds to a portion of the media content being played on the client device; in accordance with a determination that the second media content corresponds to a portion of the media content that has been played on the client device: before the predefined expiration time, obtaining third media content corresponding to the media content being played on the client device, from the one or more content feeds; and comparing the first media content with the third media content.

    Abstract translation: 公开了用于匹配实时媒体内容的系统和方法。 在服务器上,从客户端设备获取第一媒体内容,这里,第一媒体内容对应于正在客户端设备上播放的媒体内容的一部分,并且第一媒体内容与预定义的到期时间相关联; 从一个或多个内容馈送获得第二媒体内容,其中所述第二媒体内容还对应于在所述客户端设备上播放的所述媒体内容的一部分; 根据确定所述第二媒体内容对应于已经在所述客户端设备上播放的所述媒体内容的一部分:在所述预定到期时间之前,获得与在所述客户端设备上播放的所述媒体内容相对应的第三媒体内容,从 一个或多个内容馈送; 以及将所述第一媒体内容与所述第三媒体内容进行比较。

    Identifying media content
    44.
    发明授权

    公开(公告)号:US08484017B1

    公开(公告)日:2013-07-09

    申请号:US13626351

    申请日:2012-09-25

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving (i) audio data that encodes a spoken natural language query, and (ii) environmental audio data, obtaining a transcription of the spoken natural language query, determining a particular content type associated with one or more keywords in the transcription, providing at least a portion of the environmental audio data to a content recognition engine, and identifying a content item that has been output by the content recognition engine, and that matches the particular content type.

    Modeling personal entities on a mobile device using embeddings

    公开(公告)号:US10803391B2

    公开(公告)日:2020-10-13

    申请号:US14812877

    申请日:2015-07-29

    Applicant: GOOGLE INC.

    Abstract: Systems and methods are provided for a personal entity modeling for computing devices. For example, a computing device comprises at least one processor and memory storing instructions that, when executed by the at least one processor, cause the mobile device to perform operations including identifying a personal entity in content generated for display on the mobile device, generating training examples for the personal entity from the content, and updating an embedding used to model the personal entity using the training examples. The embedding may be used to make predictions regarding the personal entity. For example, the operations may also include predicting an association between a first personal entity displayed on the computing device and a second entity based on the embedding, and providing a recommendation, to be displayed on the computing device, related to the second entity.

    STRUCTURED RESPONSE SUMMARIZATION OF ELECTRONIC MESSAGES

    公开(公告)号:US20180232127A1

    公开(公告)日:2018-08-16

    申请号:US15433587

    申请日:2017-02-15

    Applicant: Google Inc.

    Abstract: A system and method for grouping and organizing structured responses in a communication application at a computing device. A structured question in a plurality of messages can be detected based on a structured question model trained via machine learning. A structured question can be a question predicted by the structured question model to have a number of possible answers fewer than a threshold. A user interface element, corresponding to the structured question, can include a structured summarization that includes one or more answers to the structured question present in the plurality of messages from the plurality of users, and/or a structured response template in which at least a subset of possible answers are presented and are selectable. A command to include the generated graphical user interface element in a record of the communication session in a graphical user interface corresponding to the communication application.

    VIDEO PLAYLISTS AND RECOMMENDATIONS BASED ON ELECTRONIC MESSAGING COMMUNICATIONS

    公开(公告)号:US20180183739A1

    公开(公告)日:2018-06-28

    申请号:US15391074

    申请日:2016-12-27

    Applicant: Google Inc.

    CPC classification number: H04L67/306 H04L51/10 H04L51/20 H04L51/32

    Abstract: A system and method includes receiving, by a server system from a first user device executing a first instance of a messaging application, a first message for a user of a second user device executing a second instance of the messaging application. The method also includes determining whether the first message includes a first reference to a first media item. The method includes responsive to determining that the first message includes the first reference to the first media item, generating media playlist information identifying the first media item. The method further includes sending the media playlist information identifying the first media item to a content sharing platform, the first media item to be added to a media playlist maintained by the content sharing platform.

    Adaptive text-to-speech outputs
    48.
    发明授权

    公开(公告)号:US09886942B2

    公开(公告)日:2018-02-06

    申请号:US15477360

    申请日:2017-04-03

    Applicant: Google Inc.

    CPC classification number: G10L13/043 G06F17/274 G06F17/2775 G10L13/08

    Abstract: In some implementations, a language proficiency of a user of a client device is determined by one or more computers. The one or more computers then determines a text segment for output by a text-to-speech module based on the determined language proficiency of the user. After determining the text segment for output, the one or more computers generates audio data including a synthesized utterance of the text segment. The audio data including the synthesized utterance of the text segment is then provided to the client device for output.

Patent Agency Ranking