Invention Grant
- Patent Title: Systems and techniques to monitor text data quality
-
Application No.: US17962719Application Date: 2022-10-10
-
Publication No.: US11748448B2Publication Date: 2023-09-05
- Inventor: Robin Astrid Epp Neufeld
- Applicant: Capital One Services, LLC
- Applicant Address: US VA McLean
- Assignee: Capital One Services, LLC
- Current Assignee: Capital One Services, LLC
- Current Assignee Address: US VA McLean
- Agency: KDW Firm PLLC
- The original application number of the division: US16406848 2019.05.08
- Main IPC: G06F11/30
- IPC: G06F11/30 ; G06F18/214 ; G06N20/00 ; G06N3/08 ; G06V30/148 ; G06F18/2411

Abstract:
Disclosed are a system, apparatus and techniques for evaluating a dataset to confirm that the data in the dataset satisfies a data quality metric. A machine learning engine or the like may evaluate text strings within the dataset may be of arbitrary length and encoded according to an encoding standard. Data vectors of a preset length may be generated from the evaluated text strings using various techniques. Each data vector may be representative of the content of the text string and a category may be assigned to the respective data vector. The category assigned to each data vectors may be evaluated with respect to other data vectors in the dataset to determine compliance with a quality metric. In the case that a number of data vectors fail to meet a predetermined quality metric, an alert may be generated to mitigate any system errors that may result from unsatisfactory data quality.
Public/Granted literature
- US20230133247A1 SYSTEMS AND TECHNIQUES TO MONITOR TEXT DATA QUALITY Public/Granted day:2023-05-04
Information query