Invention Grant
- Patent Title: Determining metadata of a dataset
-
Application No.: US16941565Application Date: 2020-07-29
-
Publication No.: US11550777B2Publication Date: 2023-01-10
- Inventor: Thomas Gschwind , Christoph Adrian Miksovic Czasch , Paolo Scotton
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Grant M. McNeilly
- Main IPC: G06F16/20
- IPC: G06F16/20 ; G06F16/23 ; G06F16/2455

Abstract:
The present disclosure relates to a method for enabling a processing of a dataset of records having a set of attributes. The method comprises: selecting a first attribute of the set of attributes and a subset of one or more second attributes of the set of attributes. Distinct values of the subset of second attributes may be determined from the dataset. For each distinct value of the determined distinct values records of the dataset that have said each distinct value may be identified, and a group of words may be formed from values of the first attribute of the identified records. Distinct word sequences may be identified in the formed groups and a level of presence of each word sequence of the word sequences in each of the formed groups may be determined. At least part of the levels of presence may be provided as metadata.
Public/Granted literature
- US20220035792A1 DETERMINING METADATA OF A DATASET Public/Granted day:2022-02-03
Information query