Invention Grant
- Patent Title: Defending machine learning systems from adversarial attacks
-
Application No.: US16696144Application Date: 2019-11-26
-
Publication No.: US11893111B2Publication Date: 2024-02-06
- Inventor: Srinivas Kruthiveti Subrahmanyeswara Sai , Aashish Kumar , Alexander Kreines , George Jose , Sambuddha Saha , Nir Morgulis , Shachar Mendelowitz
- Applicant: HARMAN INTERNATIONAL INDUSTRIES, INCORPORATED
- Applicant Address: US CT Stamford
- Assignee: Harman International Industries, Incorporated
- Current Assignee: Harman International Industries, Incorporated
- Current Assignee Address: US CT Stamford
- Agency: Artegis Law Group, LLP
- Main IPC: G06F21/55
- IPC: G06F21/55 ; G06N20/00 ; G06N3/04

Abstract:
Techniques are disclosed for detecting adversarial attacks. A machine learning (ML) system processes the input into and output of a ML model using an adversarial detection module that does not include a direct external interface. The adversarial detection module includes a detection model that generates a score indicative of whether the input is adversarial using, e.g., a neural fingerprinting technique or a comparison of features extracted by a surrogate ML model to an expected feature distribution for the output of the ML model. In turn, the adversarial score is compared to a predefined threshold for raising an adversarial flag. Appropriate remedial measures, such as notifying a user, may be taken when the adversarial score satisfies the threshold and raises the adversarial flag.
Public/Granted literature
- US20210157912A1 DEFENDING MACHINE LEARNING SYSTEMS FROM ADVERSARIAL ATTACKS Public/Granted day:2021-05-27
Information query