Invention Grant
- Patent Title: Merging synonymous entities from multiple structured sources into a dataset
-
Application No.: US15273959Application Date: 2016-09-23
-
Publication No.: US10671577B2Publication Date: 2020-06-02
- Inventor: Shilpi Ahuja , Sheng Hua Bao , Rashmi Gangadharaiah
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: SVL IP Law Edell, Shapiro & Finnan, LLC
- Agent Will Stock
- Main IPC: G06F16/215
- IPC: G06F16/215

Abstract:
Merging synonymous entities from multiple structured sources into a dataset includes receiving a first set of paired terms from a first authoritative source for a domain and a second set of paired terms from a second authoritative source for the domain. The first set of paired terms is compared to the second set of paired terms with a similarity assessment based on a clustering statistical algorithm to identify paired terms from the first set of paired terms that share a synonymous term with one or more paired terms from the second set of paired terms. The paired terms associated with the synonymous term are merged and a dataset is generated that associates a normalized version of the synonymous term with any terms included in the merged paired terms.
Public/Granted literature
- US20180089300A1 MERGING SYNONYMOUS ENTITIES FROM MULTIPLE STRUCTURED SOURCES INTO A DATASET Public/Granted day:2018-03-29
Information query