Determining model-related bias associated with training data

Invention Grant

US11302096B2 Determining model-related bias associated with training data 有权

Please log in to see more content

Patent Title: Determining model-related bias associated with training data
Application No.: US16690738

Application Date: 2019-11-21
Publication No.: US11302096B2

Publication Date: 2022-04-12
Inventor: Pranay Kumar Lohia , Diptikalyan Saha , Manish Anand Bhide , Sameep Mehta
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agency: Ryan, Mason & Lewis, LLP
Main IPC: G06N20/00
IPC: G06N20/00 ; G06V10/75 ; G06K9/62 ; G06N5/04

Abstract:

Methods, systems, and computer program products for determining model-related bias associated with training data are provided herein. A computer-implemented method includes obtaining, via execution of a first model, class designations attributed to data points used to train the first model; identifying any of the data points associated with an inaccurate class designation and/or a low-confidence class designation; training a second model using the data points from the dataset, but excluding the identified data points; determining bias related to at least a portion of those data points used to train the second model by: modifying one or more of the data points used to train the second model; executing the first model using the modified data points; and identifying a change to one or more class designations attributed to the modified data points as compared to before the modifying; and outputting identifying information pertaining to the determined bias.

Public/Granted literature

US20210158076A1 Determining Model-Related Bias Associated with Training Data Public/Granted day:2021-05-27

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习