Invention Grant
- Patent Title: Descriptor uniqueness for entity clustering
-
Application No.: US16792456Application Date: 2020-02-17
-
Publication No.: US11544312B2Publication Date: 2023-01-03
- Inventor: Donna K. Byron , Edward Graham Katz , Christopher F. Ackermann , Charles E. Beller
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Stephen J. Walder, Jr.; Brian Welle
- Main IPC: G06F16/35
- IPC: G06F16/35 ; G06F16/33 ; G06F16/332 ; G06K9/62 ; G06F40/169 ; G06N7/00 ; G06F40/295 ; G06F40/216

Abstract:
A mechanism is provided in a data processing system to implement a cognitive natural language processing (NLP) system with descriptor uniqueness identification to support named entity mention clustering. The mechanism annotates a set of documents from a corpus of documents for entity types and mentions, collects descriptor usages from all documents in the corpus of documents, analyzes the descriptor usages to classify the descriptors as base terms or modifier terms, generates compatibility scores for the descriptors, and performs entity merging of entity clusters based on the compatibility scores.
Public/Granted literature
- US20210256049A1 Descriptor Uniqueness for Entity Clustering Public/Granted day:2021-08-19
Information query