Deep data classification using governance and machine learning
Abstract:
A method of data classification includes: identifying a cluster of data classes; classifying columns of a current data set; identifying the cluster in the current data set; determining, based on the cluster, an expected column is missing from the current data set; determining a neighboring data set; identifying the expected column in the neighboring data set; classifying the expected column in the neighboring data set; creating a new data class in the current data set; and classifying an unclassified column in the current data set or the neighboring data set with the new data class.
Public/Granted literature
Information query
Patent Agency Ranking
0/0