Invention Grant
US09542388B2 Identifying unchecked criteria in unstructured and semi-structured data
有权
在非结构化和半结构化数据中识别未经检查的标准
- Patent Title: Identifying unchecked criteria in unstructured and semi-structured data
- Patent Title (中): 在非结构化和半结构化数据中识别未经检查的标准
-
Application No.: US14861193Application Date: 2015-09-22
-
Publication No.: US09542388B2Publication Date: 2017-01-10
- Inventor: Scott R. Carrier , Elena Romanova , Marie L. Setnes
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Terrile, Cannatti, Chambers & Holland, LLP
- Agent Stephen A. Terrile
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F17/28

Abstract:
A method, system and computer-usable medium are disclosed for identifying unchecked criteria in unstructured and semi-structured data within a form. Text spans representing unchecked criteria within unstructured text in a form are detected and classified to facilitate accurate interpretation of the text. Section identification and annotation operations are then performed to identify and categorize sections within the form. Checklist sections within the form, along with associated checkmarks and boxes, are then identified, followed by the identification of checked item, criteria scope, and previously undetected checklist sections. Once all checklist sections and checked criteria have been identified, remaining text spans within a checklist section are annotated as unchecked criteria.
Public/Granted literature
- US20160012041A1 Identifying Unchecked Criteria in Unstructured and Semi-Structured Data Public/Granted day:2016-01-14
Information query