Distributed system for large volume deep web data extraction

Invention Grant

US10210255B2 Distributed system for large volume deep web data extraction 有权

Please log in to see more content

Patent Title: Distributed system for large volume deep web data extraction
Application No.: US14986536

Application Date: 2015-12-31
Publication No.: US10210255B2

Publication Date: 2019-02-19
Inventor: Jason Crabtree , Andrew Sellers
Applicant: Fractal Industries, Inc.
Applicant Address: US VA Reston
Assignee: Fractal Industries, Inc.
Current Assignee: Fractal Industries, Inc.
Current Assignee Address: US VA Reston
Agency: Galvin Patent Law LLC
Agent Brian R. Galvin
Main IPC: G06F17/30
IPC: G06F17/30 ; G06Q10/06 ; H04L29/08

Abstract:

A distributed system for large volume deep web data extraction that is extremely scalable, allows multiple heterogeneous concurrent searches, has power web scrape result processing capabilities and uses a well defined, highly customizable, simplified, search agent configuration interface requiring minimal specialized programming knowledge. A scrape campaign control module receives scrape control and web spider configuration parameters through either a command line interface of an HTTP based application programming interface. The control module uses those parameters to have an arbitrary plurality of web spiders created and deployed by a plurality of servers. Scrape campaign results are presented as prescribed.

Public/Granted literature

US20170193110A1 DISTRIBUTED SYSTEM FOR LARGE VOLUME DEEP WEB DATA EXTRACTION Public/Granted day:2017-07-06

Information query

Espacenet