Invention Grant
- Patent Title: Limiting requests by web crawlers to a web host
- Patent Title (中): 将网页抓取工具的请求限制在Web主机上
-
Application No.: US10742398Application Date: 2003-12-18
-
Publication No.: US07774782B1Publication Date: 2010-08-10
- Inventor: Catalin T. Popescu , Anurag Acharya
- Applicant: Catalin T. Popescu , Anurag Acharya
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Morgan, Lewis & Bockius LLP
- Main IPC: G06F9/46
- IPC: G06F9/46 ; G06F15/16

Abstract:
A host load server balances a web host's load capacity among multiple competing web crawlers of a search engine. The host load server establishes a lease for each pair of requesting web crawler and requested web host. The lease expires at a scheduled time. If the web crawler completes its mission of retrieving documents from the web host prior to the expiration of the lease, the host load server releases the load capacity allocated to the web crawler and makes it available for other competing web crawlers. If the web crawler submits a request for renewing its lease with the web host at the scheduled time, the host load server allocates another share of load capacity to the web crawler. If the web crawler does not submit any request at the scheduled time, the host load server terminates the lease and releases the load capacity for other web crawlers.
Information query