Invention Grant
- Patent Title: System and method for classification of low relevance records in a database using instance-based classifiers and machine learning
-
Application No.: US15678657Application Date: 2017-08-16
-
Publication No.: US10585933B2Publication Date: 2020-03-10
- Inventor: Thiago Bianchi , Pablo Roberto Millicay Gonzalez , Giuliano Diniz de Morais
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Roberts Mlotkowski Safran Cole & Calderon, P.C.
- Agent William H. Hartwell; Andrew D. Wright
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06F16/35 ; G06F16/23 ; G06F16/2455 ; G06F16/28

Abstract:
Devices and methods for classification of low relevance records in a database are disclosed. A method includes: in response to a request to delete a selected database record, generating a vector representation of the selected record, deleting the selected record in the database, and storing the vector representation of the deleted selected record; in response to the storing the vector representation of the deleted selected record, determining a cluster from which the vector representation has a shortest determined distance, among a plurality of clusters into which a plurality of vector representations of deleted records is partitioned; determining a distance between a record in the database and a nearest cluster among the plurality of clusters into which the plurality of vector representations of deleted records is partitioned; and in response to the record being within a predetermined distance of the nearest cluster, determining that the record is a deletion candidate record.
Public/Granted literature
Information query