Defending machine learning systems from adversarial attacks

Invention Grant

US11893111B2 Defending machine learning systems from adversarial attacks 有权

Please log in to see more content

Patent Title: Defending machine learning systems from adversarial attacks
Application No.: US16696144

Application Date: 2019-11-26
Publication No.: US11893111B2

Publication Date: 2024-02-06
Inventor: Srinivas Kruthiveti Subrahmanyeswara Sai , Aashish Kumar , Alexander Kreines , George Jose , Sambuddha Saha , Nir Morgulis , Shachar Mendelowitz
Applicant: HARMAN INTERNATIONAL INDUSTRIES, INCORPORATED
Applicant Address: US CT Stamford
Assignee: Harman International Industries, Incorporated
Current Assignee: Harman International Industries, Incorporated
Current Assignee Address: US CT Stamford
Agency: Artegis Law Group, LLP
Main IPC: G06F21/55
IPC: G06F21/55 ; G06N20/00 ; G06N3/04

Defending machine learning systems from adversarial attacks

Abstract:

Techniques are disclosed for detecting adversarial attacks. A machine learning (ML) system processes the input into and output of a ML model using an adversarial detection module that does not include a direct external interface. The adversarial detection module includes a detection model that generates a score indicative of whether the input is adversarial using, e.g., a neural fingerprinting technique or a comparison of features extracted by a surrogate ML model to an expected feature distribution for the output of the ML model. In turn, the adversarial score is compared to a predefined threshold for raising an adversarial flag. Appropriate remedial measures, such as notifying a user, may be taken when the adversarial score satisfies the threshold and raises the adversarial flag.

Public/Granted literature

US20210157912A1 DEFENDING MACHINE LEARNING SYSTEMS FROM ADVERSARIAL ATTACKS Public/Granted day:2021-05-27

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F21/00	防止未授权行为的保护计算机、其部件、程序或数据的安全装置
G06F21/50	.监控用户、程序或设备，以维护平台完整。例如：处理器、固件或操作系统
G06F21/55	..检测本地入侵或实施对策