Invention Grant
- Patent Title: Semantic duplicate normalization and standardization
-
Application No.: US17513188Application Date: 2021-10-28
-
Publication No.: US12050873B2Publication Date: 2024-07-30
- Inventor: Hans-Martin Ramsl
- Applicant: SAP SE
- Applicant Address: DE Walldorf
- Assignee: SAP SE
- Current Assignee: SAP SE
- Current Assignee Address: DE Walldorf
- Agency: SCHWEGMAN LUNDBERG & WOESSNER, P.A.
- Main IPC: G06F40/30
- IPC: G06F40/30 ; G06F16/33 ; G06F16/36 ; G06F40/247 ; G06N20/00

Abstract:
Systems, methods, and computer-readable media are disclosed for list attribute normalization and standardization for creation of a controlled vocabulary. A vocabulary set comprising a plurality of vocabulary term may be received. For each vocabulary term, semantic duplicates may be identified. The semantic duplicates may be identified by analyzing semantics, syntactics, or phonetics of the vocabulary terms. Semantic chains may be formed from each vocabulary term and the corresponding semantic duplicates. The terms in each semantic chain may be ranked to determine a most probable vocabulary term. The most probable vocabulary term may then replace the semantic chain. The most probable vocabulary term across all semantic chains from the vocabulary set may form the controlled vocabulary.
Public/Granted literature
- US20230139644A1 SEMANTIC DUPLICATE NORMALIZATION AND STANDARDIZATION Public/Granted day:2023-05-04
Information query