Invention Grant
US09576052B2 Systems and methods of web crawling 有权
网络抓取的系统和方法

Systems and methods of web crawling
Abstract:
Methods and systems for dynamically training a web crawler. The web crawler maintains one or more categories each comprising a set of words. The method includes selecting at least one hyperlink in response to a query received from a user. The method further includes determining a hyperlink score for the at least one hyperlink based on a category score associated with each of one or more categories. The category score associated with each of the one or more categories is updated based at least in part on the hyperlink score. The updated category score is compared with the hyperlink score to select a category from the one or more categories. The set of words associated with the category is updated based on content of a web page pointed by the at least one hyperlink.
Public/Granted literature
Information query
Patent Agency Ranking
0/0