Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for verifying an identity of a user. The methods, systems, and apparatus include actions of receiving a request for a verification phrase for verifying an identity of a user. Additional actions include, in response to receiving the request for the verification phrase for verifying the identity of the user, identifying subwords to be included in the verification phrase and in response to identifying the subwords to be included in the verification phrase, obtaining a candidate phrase that includes at least some of the identified subwords as the verification phrase. Further actions include providing the verification phrase as a response to the request for the verification phrase for verifying the identity of the user.
Abstract:
Systems, methods, routines and/or techniques for synchronized content playback related to content recognition are described. A software program may cause a video to play synchronously with a song, for example, a song that is playing in an ambient environment such as a café or bar. In some embodiments, a client device may sense a song and the client device may communicate audio data related to the song to a remote server, and the remote server may identify a song that is related to the audio data. The remote server may also identify one or more videos (e.g., in a video database) that relate to the song. The remote server may communicate one or more of the videos (e.g., a link/URL) back to the client device such that the client device can play one of the videos synchronously with the song, even if playback of the video is delayed.
Abstract:
Systems and methods for matching live media content are disclosed. At a server, obtaining first media content from a client device, herein the first media content corresponds to a portion of media content being played on the client device, and the first media content is associated with a predefined expiration time; obtaining second media content from one or more content feeds, wherein the second media content also corresponds to a portion of the media content being played on the client device; in accordance with a determination that the second media content corresponds to a portion of the media content that has been played on the client device: before the predefined expiration time, obtaining third media content corresponding to the media content being played on the client device, from the one or more content feeds; and comparing the first media content with the third media content.
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving (i) audio data that encodes a spoken natural language query, and (ii) environmental audio data, obtaining a transcription of the spoken natural language query, determining a particular content type associated with one or more keywords in the transcription, providing at least a portion of the environmental audio data to a content recognition engine, and identifying a content item that has been output by the content recognition engine, and that matches the particular content type.
Abstract:
Systems and methods are provided for a personal entity modeling for computing devices. For example, a computing device comprises at least one processor and memory storing instructions that, when executed by the at least one processor, cause the mobile device to perform operations including identifying a personal entity in content generated for display on the mobile device, generating training examples for the personal entity from the content, and updating an embedding used to model the personal entity using the training examples. The embedding may be used to make predictions regarding the personal entity. For example, the operations may also include predicting an association between a first personal entity displayed on the computing device and a second entity based on the embedding, and providing a recommendation, to be displayed on the computing device, related to the second entity.
Abstract:
A system and method for grouping and organizing structured responses in a communication application at a computing device. A structured question in a plurality of messages can be detected based on a structured question model trained via machine learning. A structured question can be a question predicted by the structured question model to have a number of possible answers fewer than a threshold. A user interface element, corresponding to the structured question, can include a structured summarization that includes one or more answers to the structured question present in the plurality of messages from the plurality of users, and/or a structured response template in which at least a subset of possible answers are presented and are selectable. A command to include the generated graphical user interface element in a record of the communication session in a graphical user interface corresponding to the communication application.
Abstract:
A system and method includes receiving, by a server system from a first user device executing a first instance of a messaging application, a first message for a user of a second user device executing a second instance of the messaging application. The method also includes determining whether the first message includes a first reference to a first media item. The method includes responsive to determining that the first message includes the first reference to the first media item, generating media playlist information identifying the first media item. The method further includes sending the media playlist information identifying the first media item to a content sharing platform, the first media item to be added to a media playlist maintained by the content sharing platform.
Abstract:
In some implementations, a language proficiency of a user of a client device is determined by one or more computers. The one or more computers then determines a text segment for output by a text-to-speech module based on the determined language proficiency of the user. After determining the text segment for output, the one or more computers generates audio data including a synthesized utterance of the text segment. The audio data including the synthesized utterance of the text segment is then provided to the client device for output.
Abstract:
This disclosure relates to systems and methods for proactively determining identification information for a plurality of audio segments within a plurality of broadcast media streams, and providing identification information associated with specific audio portions of a broadcast media stream automatically or upon request.
Abstract:
Methods, systems, and apparatus for receiving a query image and a user tap location, processing the received query image based on the user tap location, identifying one or more entities associated with the processed query image and in response to receiving (i) the query image, and (ii) the user tap location, providing information about the identified one or more of the entities.