Invention Grant
- Patent Title: Methods and apparatus for transforming and statistically modeling relational databases to synthesize privacy-protected anonymized data
-
Application No.: US15836706Application Date: 2017-12-08
-
Publication No.: US10713384B2Publication Date: 2020-07-14
- Inventor: Kalyan Kumar Veeramachaneni , Neha Patki , Kishore Prabhakar Durg , Jeffrey Steven Wilkinson , Sunder Ranganathan Nochilur
- Applicant: Massachusetts Institute of Technology , Accenture Global Solutions Limited
- Applicant Address: US MA Cambridge IE Dublin
- Assignee: Massachusetts Institute of Technology,ACCENTURE GLOBAL SOLUTIONS LIMITED
- Current Assignee: Massachusetts Institute of Technology,ACCENTURE GLOBAL SOLUTIONS LIMITED
- Current Assignee Address: US MA Cambridge IE Dublin
- Agency: Smith Baluch LLP
- Main IPC: G06F21/62
- IPC: G06F21/62 ; G06N7/00 ; G06N20/00 ; G06F16/22 ; G06F16/28 ; G06F16/00

Abstract:
A relational database is transformed so as to obfuscate secure and/or private aspects of data contained in the database, while preserving salient elements of the data to facilitate data analysis. A restructured database is generatively modeled, and the model is sampled to create synthetic data that maintains sufficiently similar (or the same) mathematical properties and relations as the original data stored in the database. In one example, various statistics at the intersection of related database tables are determined by modeling data using an iterative multivariate approach. Synthetic data may be sampled from any part of the modeled database, wherein the synthesized data is “realistic” in that it statistically mimics the original data in the database. The generation of such synthetic data allows publication of bulk data freely and on-demand (e.g., for data analysis purposes), without the risk of security/privacy breaches.
Public/Granted literature
Information query