Systems and methods for quickly searching datasets by indexing synthetic data generating models
Abstract:
Systems and methods for searching datasets and classifying datasets are disclosed. For example, a system may include one or more memory units storing instructions and one or more processors configured to execute the instructions to perform operations. The operations may include receiving a test dataset from a client device and generating a test data model output using a data model, based on the test dataset. The operations may include processing test data model output by implementing an encoding method, a factorizing method, and/or a vectorizing method. The operations may include retrieving a reference data model output from a dataset index, based on a reference dataset. The operations may include generating a similarity metric based on the reference data model output and the test data model output. The operations may include classifying the test dataset based on the similarity metric and transmitting, to the client device, information comprising the classification.
Information query
Patent Agency Ranking
0/0