Invention Grant
- Patent Title: Distributed system for large volume deep web data extraction
-
Application No.: US14986536Application Date: 2015-12-31
-
Publication No.: US10210255B2Publication Date: 2019-02-19
- Inventor: Jason Crabtree , Andrew Sellers
- Applicant: Fractal Industries, Inc.
- Applicant Address: US VA Reston
- Assignee: Fractal Industries, Inc.
- Current Assignee: Fractal Industries, Inc.
- Current Assignee Address: US VA Reston
- Agency: Galvin Patent Law LLC
- Agent Brian R. Galvin
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06Q10/06 ; H04L29/08

Abstract:
A distributed system for large volume deep web data extraction that is extremely scalable, allows multiple heterogeneous concurrent searches, has power web scrape result processing capabilities and uses a well defined, highly customizable, simplified, search agent configuration interface requiring minimal specialized programming knowledge. A scrape campaign control module receives scrape control and web spider configuration parameters through either a command line interface of an HTTP based application programming interface. The control module uses those parameters to have an arbitrary plurality of web spiders created and deployed by a plurality of servers. Scrape campaign results are presented as prescribed.
Public/Granted literature
- US20170193110A1 DISTRIBUTED SYSTEM FOR LARGE VOLUME DEEP WEB DATA EXTRACTION Public/Granted day:2017-07-06
Information query