Using gradients to detect backdoors in neural networks

Invention Grant

US11132444B2 Using gradients to detect backdoors in neural networks 有权

Please log in to see more content

Patent Title: Using gradients to detect backdoors in neural networks
Application No.: US15953956

Application Date: 2018-04-16
Publication No.: US11132444B2

Publication Date: 2021-09-28
Inventor: Wilka Carvalho , Bryant Chen , Benjamin J. Edwards , Taesung Lee , Ian M. Molloy , Jialong Zhang
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agent Stephen J. Walder, Jr.; Jeffrey S. LaBaw
Main IPC: G06F21/57
IPC: G06F21/57 ; G06N3/08 ; G06N20/00

Using gradients to detect backdoors in neural networks

Abstract:

Mechanisms are provided for evaluating a trained machine learning model to determine whether the machine learning model has a backdoor trigger. The mechanisms process a test dataset to generate output classifications for the test dataset, and generate, for the test dataset, gradient data indicating a degree of change of elements within the test dataset based on the output generated by processing the test dataset. The mechanisms analyze the gradient data to identify a pattern of elements within the test dataset indicative of a backdoor trigger. The mechanisms generate, in response to the analysis identifying the pattern of elements indicative of a backdoor trigger, an output indicating the existence of the backdoor trigger in the trained machine learning model.

Public/Granted literature

US20190318099A1 Using Gradients to Detect Backdoors in Neural Networks Public/Granted day:2019-10-17

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F21/00	防止未授权行为的保护计算机、其部件、程序或数据的安全装置
G06F21/50	.监控用户、程序或设备，以维护平台完整。例如：处理器、固件或操作系统
G06F21/57	..确保或维持可信任的计算机平台，例如安全引导或断电、版本控制、系统软件检查、安全更新或评估漏洞