Invention Grant
- Patent Title: Systems and methods for quickly searching datasets by indexing synthetic data generating models
-
Application No.: US16409745Application Date: 2019-05-10
-
Publication No.: US11113124B2Publication Date: 2021-09-07
- Inventor: Austin Walters , Jeremy Goodsitt , Galen Rafferty , Vincent Pham , Anh Truong , Kate Key , Reza Farivar , Mark Watson
- Applicant: CAPITAL ONE SERVICES, LLC
- Applicant Address: US VA McLean
- Assignee: CAPITAL ONE SERVICES, LLC
- Current Assignee: CAPITAL ONE SERVICES, LLC
- Current Assignee Address: US VA McLean
- Agency: Finnegan, Henderson, Farabow, Garrett & Dunner, LLP
- Main IPC: G06N20/00
- IPC: G06N20/00 ; G06F9/54 ; G06F17/16 ; G06N3/04 ; G06F11/36 ; G06N3/08 ; G06F21/62 ; G06N5/04 ; G06F17/15 ; G06T7/194 ; G06T7/254 ; G06T7/246 ; G06F16/2455 ; G06F16/22 ; G06F16/28 ; G06F16/906 ; G06F16/93 ; G06F16/903 ; G06F16/9038 ; G06F16/9032 ; G06F16/25 ; G06F16/335 ; G06F16/242 ; G06F16/248 ; G06F30/20 ; G06F40/166 ; G06F40/117 ; G06F40/20 ; G06F8/71 ; G06F17/18 ; G06F21/55 ; G06F21/60 ; G06K9/03 ; G06K9/62 ; G06K9/66 ; G06K9/68 ; G06K9/72 ; G06N7/00 ; G06Q10/04 ; G06T11/00 ; H04L29/06 ; H04L29/08 ; H04N21/234 ; H04N21/81 ; G06N5/00 ; G06N5/02

Abstract:
Systems and methods for searching datasets and classifying datasets are disclosed. For example, a system may include one or more memory units storing instructions and one or more processors configured to execute the instructions to perform operations. The operations may include receiving a test dataset from a client device and generating a test data model output using a data model, based on the test dataset. The operations may include processing test data model output by implementing an encoding method, a factorizing method, and/or a vectorizing method. The operations may include retrieving a reference data model output from a dataset index, based on a reference dataset. The operations may include generating a similarity metric based on the reference data model output and the test data model output. The operations may include classifying the test dataset based on the similarity metric and transmitting, to the client device, information comprising the classification.
Public/Granted literature
- US20200012662A1 SYSTEMS AND METHODS FOR QUICKLY SEARCHING DATASETS BY INDEXING SYNTHETIC DATA GENERATING MODELS Public/Granted day:2020-01-09
Information query