Machine learning modeling to identify sensitive data

Invention Grant

US12293003B2 Machine learning modeling to identify sensitive data 有权

Please log in to see more content

Patent Title: Machine learning modeling to identify sensitive data
Application No.: US18654684

Application Date: 2024-05-03
Publication No.: US12293003B2

Publication Date: 2025-05-06
Inventor: Shubhanshu Gupta , Ashish Awasthi , Amaruvi Devanathan , Mallapu Raghavulu Surya Prakash
Applicant: Citibank, N.A.
Applicant Address: US NY New York
Assignee: Citibank, N.A.
Current Assignee: Citibank, N.A.
Current Assignee Address: US NY New York
Agency: Foley & Lardner LLP
Main IPC: G06F21/62
IPC: G06F21/62 ; G06F16/22 ; G06F16/334 ; G06F16/335

Machine learning modeling to identify sensitive data

Abstract:

Methods and systems herein identify and redact personally identifiable information. A PII sensitivity detection framework includes multiple layers where each layer corresponds to a computer model. The framework analyzes data stored within different data tables and predicts whether a data column includes PII. The first layer corresponds to an artificial intelligence model that analyzes each column metadata and predicts a first score indicative of a likelihood of PII. The second layer corresponds to a rule-based computer model that uses various rules to determine a second score indicative of a likelihood of PII for each column. The third layer corresponds to a column content model that analyzes content of each column using various natural language processing techniques to generate a third score indicative of a likelihood of PII. The framework masks data being presented to a user based on the scores generated via execution of one or more of the layers.

Public/Granted literature

US20240289492A1 MACHINE LEARNING MODELING TO IDENTIFY SENSITIVE DATA Public/Granted day:2024-08-29

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F21/00	防止未授权行为的保护计算机、其部件、程序或数据的安全装置
G06F21/60	.保护数据
G06F21/62	..通过一个平台保护数据存取访问，例如使用密钥或访问控制规则