Invention Grant
- Patent Title: Machine-learning based detection and classification of personally identifiable information
-
Application No.: US16125389Application Date: 2018-09-07
-
Publication No.: US10585989B1Publication Date: 2020-03-10
- Inventor: Mohamed N. Ahmed , Andeep S. Toor
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Garg Law Firm, PLLC
- Agent Rakesh Garg; James Nock
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06N3/04 ; G06N3/08

Abstract:
Detection and classification of personally identifiable information includes identifying a document with a known author. A first set of features of the document is extracted using natural language processing, and a second set of features of the document is extracted based upon one or more past documents for the known author using a recurrent neural network. The first set of features and the second set of features are classified using a classifier to produce classified extracted features. Personally identifiable information is labeled in the document based upon the classified extracted features.
Public/Granted literature
- US20200081978A1 MACHINE-LEARNING BASED DETECTION AND CLASSIFICATION OF PERSONALLY IDENTIFIABLE INFORMATION Public/Granted day:2020-03-12
Information query