Invention Grant
- Patent Title: Method for deducing entity relationships across corpora using cluster based dictionary vocabulary lexicon
-
Application No.: US15597677Application Date: 2017-05-17
-
Publication No.: US10664505B2Publication Date: 2020-05-26
- Inventor: Donna K. Byron , Swaminathan Chandrasekaran , Lakshminarayanan Krishnamurthy
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Terrile, Cannatti & Chambers, LLP
- Agent Michael Rocco Cannatti
- Main IPC: G06F16/30
- IPC: G06F16/30 ; G06F16/33 ; G06F16/245 ; G06F16/35 ; G06F40/30

Abstract:
An approach is provided for identifying entity relationships based on word classifications extracted from business documents stored in a plurality of corpora. In the approach, performed by an information handling system, a plurality of cluster classifications are identified for the business documents so that entity information from the business documents can be classified or assigned to the cluster classifications, such as by performing natural language processing (NLP) analysis of the business documents. The approach applies semantic analysis to identify and score entity relationships between the entity information classified in the cluster classifications, and based on the scored entity relationships, cluster relationships between the cluster classifications are identified.
Public/Granted literature
Information query