Invention Grant
US08880588B2 Technique for stateless distributed parallel crawling of interactive client-server applications
有权
交互式客户端 - 服务器应用程序的无状态分布式并行爬行技术
- Patent Title: Technique for stateless distributed parallel crawling of interactive client-server applications
- Patent Title (中): 交互式客户端 - 服务器应用程序的无状态分布式并行爬行技术
-
Application No.: US12957381Application Date: 2010-11-30
-
Publication No.: US08880588B2Publication Date: 2014-11-04
- Inventor: Mukul Ranjan Prasad
- Applicant: Mukul Ranjan Prasad
- Applicant Address: JP Kawasaki-shi
- Assignee: Fujitsu Limited
- Current Assignee: Fujitsu Limited
- Current Assignee Address: JP Kawasaki-shi
- Agency: Baker Botts L.L.P.
- Main IPC: G06F15/16
- IPC: G06F15/16 ; G06F11/36 ; G06F9/50 ; G06Q10/06

Abstract:
A distributed computing system includes worker nodes and a master node including a processor coupled to a memory. Each worker node crawls a portion of an interactive client-server application. The memory includes a master state graph, including the results of crawling. The master node is configured to examine the master state graph to determine a number of reconverging traces, receive a result from a job from a worker node if the number of reconverging traces is below a threshold, and add the result to the master state graph without attempting to remove duplicate states or transitions. A trace includes states and transitions representing valid. A reconvergent trace includes a trace including a reconvergent state, which is a state that can be reached through two or more distinct traces. The result containing states and transitions is associated with crawling a first portion of the interactive client-server application.
Public/Granted literature
- US20120110063A1 TECHNIQUE FOR STATELESS DISTRIBUTED PARALLEL CRAWLING OF INTERACTIVE CLIENT-SERVER APPLICATIONS Public/Granted day:2012-05-03
Information query