Invention Grant
- Patent Title: Web document keyword and phrase extraction
- Patent Title (中): Web文档关键字和短语提取
-
Application No.: US11619230Application Date: 2007-01-03
-
Publication No.: US08135728B2Publication Date: 2012-03-13
- Inventor: Wen-tau Yih , Joshua T. Goodman , Vitor Rocha de Carvalho
- Applicant: Wen-tau Yih , Joshua T. Goodman , Vitor Rocha de Carvalho
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agency: Lee & Hayes, PLLC
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30 ; G06F13/14

Abstract:
Extraction analysis techniques biased, in part, by query frequency information from a query log file and/or search engine cache are employed along with machine learning processes to determine candidate keywords and/or phrases of web documents. Web oriented features associated with the candidate keywords and/or phrases are also utilized to analyze the web documents. A keyword and/or phrase extraction mechanism can be utilized to score keywords and/or phrases in a web document and estimate a likelihood that the keywords and/or phrases are relevant, for example, in an advertising system and the like.
Public/Granted literature
- US20070112764A1 WEB DOCUMENT KEYWORD AND PHRASE EXTRACTION Public/Granted day:2007-05-17
Information query