Invention Grant
US07941421B2 Unsupervised detection of web pages corresponding to a similarity class
有权
对相似性类别对应的网页进行无监督检测
- Patent Title: Unsupervised detection of web pages corresponding to a similarity class
- Patent Title (中): 对相似性类别对应的网页进行无监督检测
-
Application No.: US12716195Application Date: 2010-03-02
-
Publication No.: US07941421B2Publication Date: 2011-05-10
- Inventor: Mahesh Tiyyagura
- Applicant: Mahesh Tiyyagura
- Applicant Address: US CA Sunnyvale
- Assignee: Yahoo! Inc.
- Current Assignee: Yahoo! Inc.
- Current Assignee Address: US CA Sunnyvale
- Agency: Weaver Austin Villeneuve and Sampson LLP
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A method of detecting web pages belonging to at least one similarity class from a plurality of web pages includes determining clusters of the plurality of web pages based on characteristics of the content of the web pages. For each of the determined clusters, at least one metric is determined indicative of similarity among resource locators associated with the web pages of that cluster. A determination of web pages belonging to the at least one similarity class is based on the determined clusters and the determined similarity metrics.
Public/Granted literature
- US20100161588A1 UNSUPERVISED DETECTION OF WEB PAGES CORRESPONDING TO A SIMILARITY CLASS Public/Granted day:2010-06-24
Information query