Invention Grant
US07962523B2 System and method for detecting templates of a website using hyperlink analysis
有权
使用超链接分析检测网站模板的系统和方法
- Patent Title: System and method for detecting templates of a website using hyperlink analysis
- Patent Title (中): 使用超链接分析检测网站模板的系统和方法
-
Application No.: US12101293Application Date: 2008-04-11
-
Publication No.: US07962523B2Publication Date: 2011-06-14
- Inventor: Krishna Leela Poola
- Applicant: Krishna Leela Poola
- Applicant Address: US CA Sunnyvale
- Assignee: Yahoo! Inc.
- Current Assignee: Yahoo! Inc.
- Current Assignee Address: US CA Sunnyvale
- Agency: Ostrow Kaufman LLP
- Agent Seth H. Ostrow
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30 ; G06F13/14

Abstract:
The present invention relates to methods, systems, and computer readable media comprising instructions for detecting templates within one or more web pages comprising a website. The method of the present invention comprises generating one or more groups of hyperlinks within a respective web page of the one or more web pages comprising the website. An in-link score is calculated for a given uniform resource locator associated with the one or more web pages comprising the website. The hyperlink groups in which the uniform resource locators associated with the one or more web pages comprising the website appear are identified. A template score is assigned to the identified hyperlinks groups on the basis of the in-link score associated with the uniform resource locators to which the hyperlinks comprising the hyperlink group correspond. The hyperlink groups with template scores exceeding a given template score threshold are thereafter identified as templates.
Public/Granted literature
- US20090259649A1 SYSTEM AND METHOD FOR DETECTING TEMPLATES OF A WEBSITE USING HYPERLINK ANALYSIS Public/Granted day:2009-10-15
Information query