Systems and/or methods for machine-learning based data correction and completion in sparse datasets

Invention Grant

US11875234B2 Systems and/or methods for machine-learning based data correction and completion in sparse datasets 有权

Please log in to see more content

Patent Title: Systems and/or methods for machine-learning based data correction and completion in sparse datasets
Application No.: US16932066

Application Date: 2020-07-17
Publication No.: US11875234B2

Publication Date: 2024-01-16
Inventor: Vijay Anand Chidambaram , Ulrich Kalex
Applicant: Software AG
Applicant Address: DE Darmstadt
Assignee: SOFTWARE AG
Current Assignee: SOFTWARE AG
Current Assignee Address: DE Darmstadt
Agency: Nixon & Vanderhye P.C.
Priority: IN 2011020330 2020.05.14
Main IPC: G06F7/00
IPC: G06F7/00 ; G06N20/00 ; G06Q10/06 ; G06F18/23213 ; G06F16/28

Systems and/or methods for machine-learning based data correction and completion in sparse datasets

Abstract:

Certain example embodiments herein relate to techniques for automatically correcting and completing data in sparse datasets. Records in the dataset are divided into groups with properties having similar values. For each group, one or more properties of the records therein that is/are to be ignored is/are identified, based on record distances relative to the records in the group, and distances among values for each of the properties of the records in the respective group. The records in the groups are further divided into sub-groups without regard to the one or more properties that is/are to be ignored. The sub-groups include a smaller and more cohesive set of records. For each sub-group: based on the records therein, predicted values to be applied to values identified as being empty but needing to be filled in are determined; and those predicted values are applied. The corrected/completed dataset is provided as output.

Public/Granted literature

US20210357706A1 SYSTEMS AND/OR METHODS FOR MACHINE-LEARNING BASED DATA CORRECTION AND COMPLETION IN SPARSE DATASETS Public/Granted day:2021-11-18

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F7/00	通过待处理的数据的指令或内容进行运算的数据处理的方法或装置（逻辑电路入H03K19/00）