Invention Grant
- Patent Title: Query-directed discovery and alignment of collections of document passages for improving named entity disambiguation precision
-
Application No.: US16278805Application Date: 2019-02-19
-
Publication No.: US10936819B2Publication Date: 2021-03-02
- Inventor: Charles E. Beller , Christopher F. Ackermann , Michael Drzewucki , Andrew Doyle , Edward G. Katz , Kristen M. Summers
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agent Feb R. Cabrasawan; Amy J. Pattillo
- Main IPC: G06F40/295
- IPC: G06F40/295 ; G06F16/242 ; G06N5/04 ; G06F16/93 ; G06F16/2457

Abstract:
A query system identifies a collection of discovered entity bins each comprising unstructured documents with mentions of a name element from a name query and each identified with a particular named entity identifiable from the name element. The query system identifies, from a knowledge base of structured documents, based on identifier components with the name element, candidate records identifying the respective identifier components with the name element, the one or more identifier components identified among the discovery entity bins. For each respective selection of candidate records associated with each bin, the query system applies one or more alignment threshold rules to rank the likelihood that each candidate record within each respective selection matches one or more characteristics of the respective discovery entity bin. The query system aligns, with each of the discovery entity bins, a highest ranked record from among each respective selection of candidate records, where the respective aligned highest ranked record identifies a distinct named entity from among the named entities.
Public/Granted literature
Information query