Invention Grant
- Patent Title: Interactive web crawler
- Patent Title (中): 互动式网页抓取工具
-
Application No.: US14965570Application Date: 2015-12-10
-
Publication No.: US09524343B2Publication Date: 2016-12-20
- Inventor: Chao Liu , Chao Zhou , Yi-Min Wang
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agent Alin Corie; Sandy Swain; Micky Minhas
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F3/0482

Abstract:
The claimed subject matter provides a system or method for web crawling hidden files. An example method includes loading a web page with a browser agent, and executing any dynamic elements hosted on the web page using the browser agent to insert pre-determined values. A list of form controls may be retrieved from the web page using the browser agent, and the controls may be analyzed using a driver component. Form control values may be sent from the driver component to the browser agent, and an event may be submitted to the web page by the browser agent or scripted content may be run to trigger operations on the web page corresponding to the form control values. A URL may be generated for various form control values using a generalizer.
Public/Granted literature
- US20160110456A1 INTERACTIVE WEB CRAWLER Public/Granted day:2016-04-21
Information query