Using multimodal model consistency to detect adversarial attacks

Invention Grant

US11977625B2 Using multimodal model consistency to detect adversarial attacks 有权

Please log in to see more content

Patent Title: Using multimodal model consistency to detect adversarial attacks
Application No.: US18196712

Application Date: 2023-05-12
Publication No.: US11977625B2

Publication Date: 2024-05-07
Inventor: Ian Michael Molloy , Youngja Park , Taesung Lee , Wenjie Wang
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agent Edward J. Wixted, III
Main IPC: G06F21/52
IPC: G06F21/52 ; G06F21/64 ; G06N20/00

Using multimodal model consistency to detect adversarial attacks

Abstract:

A method, apparatus and computer program product to defend learning models that are vulnerable to adversarial example attack. It is assumed that data (a “dataset”) is available in multiple modalities (e.g., text and images, audio and images in video, etc.). The defense approach herein is premised on the recognition that the correlations between the different modalities for the same entity can be exploited to defend against such attacks, as it is not realistic for an adversary to attack multiple modalities. To this end, according to this technique, adversarial samples are identified and rejected if the features from one (the attacked) modality are determined to be sufficiently far away from those of another un-attacked modality for the same entity. In other words, the approach herein leverages the consistency between multiple modalities in the data to defend against adversarial attacks on one modality.

Public/Granted literature

US20230281298A1 USING MULTIMODAL MODEL CONSISTENCY TO DETECT ADVERSARIAL ATTACKS Public/Granted day:2023-09-07

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F21/00	防止未授权行为的保护计算机、其部件、程序或数据的安全装置
G06F21/50	.监控用户、程序或设备，以维护平台完整。例如：处理器、固件或操作系统
G06F21/52	..在程序执行过程中，例如堆栈完整性、缓冲区溢出或防止不必要的数据擦除