Invention Grant
US07949661B2 System and method for identifying web communities from seed sets of web pages
有权
用于从种子网页识别网络社区的系统和方法
- Patent Title: System and method for identifying web communities from seed sets of web pages
- Patent Title (中): 用于从种子网页识别网络社区的系统和方法
-
Application No.: US11510412Application Date: 2006-08-24
-
Publication No.: US07949661B2Publication Date: 2011-05-24
- Inventor: Reid Marlow Andersen , Kevin John Lang
- Applicant: Reid Marlow Andersen , Kevin John Lang
- Applicant Address: US CA Sunnyvale
- Assignee: Yahoo! Inc.
- Current Assignee: Yahoo! Inc.
- Current Assignee Address: US CA Sunnyvale
- Agency: Baker Botts L.L.P.
- Main IPC: G06F17/00
- IPC: G06F17/00

Abstract:
An improved system and method is provided for identifying web communities from seed sets of web pages. A seed set of web pages may be represented as a set of seed vertices of a graph representing a collection of web pages. An initial probability distribution may be constructed on vertices of the graph by assigning a nonzero value to the vertices belonging to the seed set. Then a sequence of probability distributions may be produced on the vertices of the graph by modifying the probability distribution over a series of one-step walks of the probability distribution over the vertices of the graph. For each probability distribution produced in the sequence, level sets of vertices may be generated, and a level set with minimal conductance may be selected for each probability distribution. The level set with the least conductance may then be output representing a community of web pages.
Public/Granted literature
- US20080052263A1 System and method for identifying web communities from seed sets of web pages Public/Granted day:2008-02-28
Information query