Invention Grant
- Patent Title: Scheduling resource crawls
- Patent Title (中): 调度资源爬网
-
Application No.: US13011426Application Date: 2011-01-21
-
Publication No.: US08868541B2Publication Date: 2014-10-21
- Inventor: Zhen Lin , Keith Stevens
- Applicant: Zhen Lin , Keith Stevens
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for scheduling resource crawls. In one aspect, a framework is provided for scheduling resource crawls such that a crawl scheduler determines the health of a document, i.e., whether it can be crawled, the popularity of the document, and the frequency of “interesting,” i.e., substantive, content changes, and based on this information, estimates an appropriate crawl interval for each web resource to improve crawl resource utilization.
Public/Granted literature
- US20130144858A1 SCHEDULING RESOURCE CRAWLS Public/Granted day:2013-06-06
Information query