Invention Grant
- Patent Title: Building and maintaining information extraction rules
-
Application No.: US15253613Application Date: 2016-08-31
-
Publication No.: US10296573B2Publication Date: 2019-05-21
- Inventor: Arnaldo Carreno-Fuentes , Laura Chiticariu , Eser Kandogan , Yunyao Li , Huahai Yang
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Ference & Associates LLC
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06F17/24 ; G06F16/93 ; G06F17/21 ; G06F3/0482 ; G06F3/0486 ; G06F17/22

Abstract:
Methods and arrangements for managing development of information extraction rules. One or more documents are opened for extraction. An interface is provided to create a label and thereupon label a portion of the document. The created label is stored, and an extractor is developed based on the labeling. A test interface is provided for the extractor, and results of a test conducted through the test interface are displayed. The extractor is exported. In accordance with at least one embodiment, developers are presented with eased automated guidance to write extractors, which thereby reduces an overall manual effort involved in extractor development. Generally, a focused, tutorial-type environment serves as a guide based on previously developed best practices.
Public/Granted literature
- US20160371243A1 BUILDING AND MAINTAINING INFORMATION EXTRACTION RULES Public/Granted day:2016-12-22
Information query