Invention Grant
- Patent Title: Configuring web crawler to extract web page information
-
Application No.: US14523731Application Date: 2014-10-24
-
Publication No.: US09330179B2Publication Date: 2016-05-03
- Inventor: Yiming Sun , Qi Qiang , Boyang Cai , Xiaojun Jin , Zongyuan Wu
- Applicant: Alibaba Group Holding Limited
- Applicant Address: KY
- Assignee: Alibaba Group Holding Limited
- Current Assignee: Alibaba Group Holding Limited
- Current Assignee Address: KY
- Agency: Van Pelt, Yi & James LLP
- Priority: CN201110207897 20110722
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F7/00 ; G06Q30/02 ; G06Q10/10

Abstract:
Web crawling configuration includes: obtaining a webpage comprising a plurality of receiving a user selection of a node in the webpage; presenting a set of web crawling configuration options pertaining to a web crawling action to be performed with respect to the node, the set of web crawling configuration options depending at least in part on a type of an element included in the node and comprising: a first option to perform a first web crawling action in the event that the node include a first type of the element; and a second option to perform a second web crawling action in the event that the node includes a second type of the element; receiving a user input specifying the web crawling configuration option; and storing user specified web crawling configuration option, performing the web crawling action on the node according to the user input, or both.
Public/Granted literature
- US20150106357A1 CONFIGURING WEB CRAWLER TO EXTRACT WEB PAGE INFORMATION Public/Granted day:2015-04-16
Information query