Invention Grant
- Patent Title: Configuring web crawler to extract web page information
- Patent Title (中): 配置网页抓取工具来提取网页信息
-
Application No.: US14081105Application Date: 2013-11-15
-
Publication No.: US09015144B2Publication Date: 2015-04-21
- Inventor: Yiming Sun , Qi Qiang , Boyang Cai , Xiaojun Jin , Zongyuan Wu
- Applicant: Alibaba Group Holding Limited
- Applicant Address: KY George Town, Grand Cayman
- Assignee: Alibaba Group Holding Limited
- Current Assignee: Alibaba Group Holding Limited
- Current Assignee Address: KY George Town, Grand Cayman
- Agency: Van Pelt, Yi & James LLP
- Priority: CN201110207897 20110722
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F7/00 ; G06Q30/02 ; G06Q10/10

Abstract:
Web crawling configuration includes: obtaining a webpage comprising a plurality of receiving a user selection of a node in the webpage; presenting a set of web crawling configuration options pertaining to a web crawling action to be performed with respect to the node, the set of web crawling configuration options depending at least in part on a type of an element included in the node and comprising: a first option to perform a first web crawling action in the event that the node include a first type of the element; and a second option to perform a second web crawling action in the event that the node includes a second type of the element; receiving a user input specifying the web crawling configuration option; and storing user specified web crawling configuration option, performing the web crawling action on the node according to the user input, or both.
Public/Granted literature
- US20140129541A1 CONFIGURING WEB CRAWLER TO EXTRACT WEB PAGE INFORMATION Public/Granted day:2014-05-08
Information query