Invention Grant
- Patent Title: Dataset quality for synthetic data generation in computer-based reasoning systems
-
Application No.: US17333671Application Date: 2021-05-28
-
Publication No.: US11640561B2Publication Date: 2023-05-02
- Inventor: Christopher James Hazard , Jacob David Beel , Yash Shah , Ravisutha Sakrepatna Srinivasamurthy , Michael Resnick
- Applicant: Diveplane Corporation
- Applicant Address: US NC Raleigh
- Assignee: Diveplane Corporation
- Current Assignee: Diveplane Corporation
- Current Assignee Address: US NC Raleigh
- Agency: Dority & Manning, P.A.
- Main IPC: G06F15/16
- IPC: G06F15/16 ; G06K9/62 ; G06N5/045 ; G06N20/00 ; G06F21/62

Abstract:
Techniques for synthetic data generation in computer-based reasoning systems are discussed and include receiving a request for generation of synthetic data based on a set of training data cases. One or more focal training data cases are determined. For undetermined features (either all of them or those that are not subject to conditions), a value for the feature is determined based on the focal cases. In some embodiments, the generated synthetic data may be checked for similarity against the training data, and if similarity conditions are met, it may be modified (e.g., resampled), removed, and/or replaced.
Public/Granted literature
- US20230140834A9 Dataset Quality for Synthetic Data Generation in Computer-Based Reasoning Systems Public/Granted day:2023-05-04
Information query