Invention Grant
- Patent Title: Data generalization for predictive models
-
Application No.: US16532505Application Date: 2019-08-06
-
Publication No.: US11281728B2Publication Date: 2022-03-22
- Inventor: Gilad Ezov , Ariel Farkash , Abigail Goldsteen , Ron Shmelkin , Micha Gideon Moffie
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Ziv Glazberg
- Main IPC: G06F16/906
- IPC: G06F16/906 ; G06K9/62

Abstract:
A method, apparatus and a product for data generalization for predictive models. The method comprising: based on a labeled dataset, determining a plurality of buckets, each of which has an associated label; determining a plurality of clusters, grouping similar instances in the same bucket; based on the plurality of clusters, determining an alternative set of features comprising a set of generalized features, wherein each generalized feature corresponds to a cluster of the plurality of clusters, wherein a generalized feature that corresponds to a cluster is indicative of the instance being mapped to the corresponding cluster; obtaining a second instance; determining a generalized second instance that comprises a valuation of the alternative set of features for the second instance; and based on the generalized second instance, determining a label for the second instance.
Public/Granted literature
- US20210042356A1 DATA GENERALIZATION FOR PREDICTIVE MODELS Public/Granted day:2021-02-11
Information query