Invention Grant
- Patent Title: Systems and methods for subset selection and optimization for balanced sampled dataset generation
-
Application No.: US16730111Application Date: 2019-12-30
-
Publication No.: US11675926B2Publication Date: 2023-06-13
- Inventor: Christopher Muffat , Tetiana Kodliuk
- Applicant: Dathena Science Pte Ltd
- Applicant Address: SG Singapore
- Assignee: DATHENA SCIENCE PTE LTD
- Current Assignee: DATHENA SCIENCE PTE LTD
- Current Assignee Address: SG Singapore
- Agency: FisherBroyles, LLP
- Agent Jason P. Mueller
- Priority: SG 201811834U 2018.12.31
- Main IPC: G06F16/93
- IPC: G06F16/93 ; G06F21/62 ; G06F16/9035 ; G06N20/00 ; G06F16/906 ; G06F18/23213

Abstract:
Methods and systems for data management of documents in one or more data repositories in a computer network or cloud infrastructure are provided. The method includes sampling the documents in the one or more data repositories and formulating representative subsets of the sampled documents. The method further includes generating sampled data sets of the sampled documents and balancing the sampled data sets for further processing of the sampled documents. The formulation of the representative subsets is performed for identification of some of the representative subsets for initial processing.
Public/Granted literature
- US20200250241A1 SYSTEMS AND METHODS FOR SUBSET SELECTION AND OPTIMIZATION FOR BALANCED SAMPLED DATASET GENERATION Public/Granted day:2020-08-06
Information query