Invention Grant
- Patent Title: Scheduler for search engine crawler
- Patent Title (中): 搜索引擎抓取器的计划程序
-
Application No.: US13449228Application Date: 2012-04-17
-
Publication No.: US08775403B2Publication Date: 2014-07-08
- Inventor: Keith H. Randall
- Applicant: Keith H. Randall
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Morgan, Lewis & Bockius LLP
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A scheduler for a search engine crawler includes a history log containing document identifiers (e.g., URLs) corresponding to documents (e.g., web pages) on a network (e.g., Internet). The scheduler is configured to process each document identifier in a set of the document identifiers by determining a content change frequency of the document corresponding to the document identifier, determining a first score for the document identifier that is a function of the determined content change frequency of the corresponding document, comparing the first score against a threshold value, and scheduling the corresponding document for indexing based on the results of the comparison.
Public/Granted literature
- US20120317089A1 Scheduler for Search Engine Crawler Public/Granted day:2012-12-13
Information query