Invention Grant
- Patent Title: Methods, systems, and media for language identification of a media content item based on comments
-
Application No.: US15174668Application Date: 2016-06-06
-
Publication No.: US10430835B2Publication Date: 2019-10-01
- Inventor: Ayşe Seza Doğruöz , Natalia Ponomareva , Christoph Urs Oehler , Dimitri Kanevsky
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Byrne Poh LLP
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06Q30/02 ; G06F17/27 ; G06Q50/00

Abstract:
Methods, systems, and media for language identification of a media content item based on comments are provided. In some embodiments, the method includes: obtaining a plurality of comments associated with a media content item; selecting a subset of the plurality of comments based on one or more criteria; assigning, for each comment in the subset of the plurality of comments, a vector of language probabilities, wherein each component of the vector is assigned a language probability that indicates the likelihood that the comment includes content in a language from a plurality of languages; combining the vector of language probabilities for each comment in the subset of the plurality of comments to generate a combined language vector; identifying a language associated with the media content item based on the combined language vector; and performing an action based on the identified language.
Public/Granted literature
- US20170300976A1 METHODS, SYSTEMS, AND MEDIA FOR LANGUAGE IDENTIFICATION OF A MEDIA CONTENT ITEM BASED ON COMMENTS Public/Granted day:2017-10-19
Information query