Invention Grant
- Patent Title: Pre-processing a table in a document for natural language processing
-
Application No.: US18154665Application Date: 2023-01-13
-
Publication No.: US11869264B2Publication Date: 2024-01-09
- Inventor: Scott Carrier , Ritwik Ray , Jonathan Chapin Rand , Jothilakshmi Sirangimoorthy , Hui Wang , Robert Fredenburg
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: KONRAD RAYNES DAVDA & VICTOR LLP
- Agent David W. Victor
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06V30/412 ; G06F3/0482 ; G06F40/237 ; G06F40/40 ; G06V30/416

Abstract:
Provided are a computer program product, system, and method for pre-processing a table in a document for natural language processing. A table in a document is parsed to extract column headers, row headers, and data cells, which are processed to determine an initial set of a main element comprising an entity whose value is to be extracted, a conditional element that refines the entity, and a value element comprising a value for the entity. A user selection is received of at least one of the column headers, row headers, and data cells for at least one of the main element, conditional element, and the value element in the initial set to produce a modified set of the main element, conditional element, and value element. The modified set is provided to a natural language processing engine to perform natural language processing of the document including the table, using the modified set.
Public/Granted literature
- US20230154220A1 PRE-PROCESSING A TABLE IN A DOCUMENT FOR NATURAL LANGUAGE PROCESSING Public/Granted day:2023-05-18
Information query