Invention Grant
- Patent Title: Machine learning modeling to identify sensitive data
-
Application No.: US18654684Application Date: 2024-05-03
-
Publication No.: US12293003B2Publication Date: 2025-05-06
- Inventor: Shubhanshu Gupta , Ashish Awasthi , Amaruvi Devanathan , Mallapu Raghavulu Surya Prakash
- Applicant: Citibank, N.A.
- Applicant Address: US NY New York
- Assignee: Citibank, N.A.
- Current Assignee: Citibank, N.A.
- Current Assignee Address: US NY New York
- Agency: Foley & Lardner LLP
- Main IPC: G06F21/62
- IPC: G06F21/62 ; G06F16/22 ; G06F16/334 ; G06F16/335

Abstract:
Methods and systems herein identify and redact personally identifiable information. A PII sensitivity detection framework includes multiple layers where each layer corresponds to a computer model. The framework analyzes data stored within different data tables and predicts whether a data column includes PII. The first layer corresponds to an artificial intelligence model that analyzes each column metadata and predicts a first score indicative of a likelihood of PII. The second layer corresponds to a rule-based computer model that uses various rules to determine a second score indicative of a likelihood of PII for each column. The third layer corresponds to a column content model that analyzes content of each column using various natural language processing techniques to generate a third score indicative of a likelihood of PII. The framework masks data being presented to a user based on the scores generated via execution of one or more of the layers.
Public/Granted literature
- US20240289492A1 MACHINE LEARNING MODELING TO IDENTIFY SENSITIVE DATA Public/Granted day:2024-08-29
Information query