Generating quantitatively assessed synthetic training data

Invention Grant

US11636390B2 Generating quantitatively assessed synthetic training data 有权

Please log in to see more content

Patent Title: Generating quantitatively assessed synthetic training data
Application No.: US16823772

Application Date: 2020-03-19
Publication No.: US11636390B2

Publication Date: 2023-04-25
Inventor: Gabriele Ranco , Moises Noe Sanchez Garcia , Gordon Doyle
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agent Randy E. Tejeda
Main IPC: G06N20/00
IPC: G06N20/00 ; G06K9/62 ; G06F17/16 ; G06F17/18

Abstract:

In an approach to generating quantitatively assessed synthetic training data, one or more computer processors identify an initial plurality of clusters in a dataset utilizing a trained classification model and a plurality of associated hyperparameters, wherein the clusters have sufficient density to be represented in a calculated probability distribution. The one or more computer processors generate one or more synthetic data points for each identified cluster utilizing a corresponding calculated probability distribution. The one or more computer processors quantitatively assess the one or more generated synthetic data points.

Public/Granted literature

US20210295205A1 GENERATING QUANTITATIVELY ASSESSED SYNTHETIC TRAINING DATA Public/Granted day:2021-09-23

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习