Invention Grant
- Patent Title: Matching text to images
-
Application No.: US12979375Application Date: 2010-12-28
-
Publication No.: US08503769B2Publication Date: 2013-08-06
- Inventor: Simon Baker , Dahua Lin , Anitha Kannan , Qifa Ke
- Applicant: Simon Baker , Dahua Lin , Anitha Kannan , Qifa Ke
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Main IPC: G06K9/62
- IPC: G06K9/62 ; G06F17/00

Abstract:
Text in web pages or other text documents may be classified based on the images or other objects within the webpage. A system for identifying and classifying text related to an object may identify one or more web pages containing the image or similar images, determine topics from the text of the document, and develop a set of training phrases for a classifier. The classifier may be trained and then used to analyze the text in the documents. The training set may include both positive examples and negative examples of text taken from the set of documents. A positive example may include captions or other elements directly associated with the object, while negative examples may include text taken from the documents, but from a large distance from the object. In some cases, the system may iterate on the classification process to refine the results.
Public/Granted literature
- US20120163707A1 MATCHING TEXT TO IMAGES Public/Granted day:2012-06-28
Information query