- Patent Title: Open source vulnerability prediction with machine learning ensemble
-
Application No.: US16105016Application Date: 2018-08-20
-
Publication No.: US11416622B2Publication Date: 2022-08-16
- Inventor: Asankhaya Sharma , Yaqin Zhou
- Applicant: Veracode, Inc.
- Applicant Address: US MA Burlington
- Assignee: Veracode, Inc.
- Current Assignee: Veracode, Inc.
- Current Assignee Address: US MA Burlington
- Agency: Gilliam IP PLLC
- Main IPC: G06F21/57
- IPC: G06F21/57 ; G06N7/00 ; G06N99/00 ; G06N20/00

Abstract:
A system to create a stacked classifier model combination or classifier ensemble has been designed for identification of undisclosed flaws in software components on a large-scale. This classifier ensemble is capable of at least a 54.55% improvement in precision. The system uses a K-folding cross validation algorithm to partition a sample dataset and then train and test a set of N classifiers with the dataset folds. At each test iteration, trained models of the set of classifiers generate probabilities that a sample has a flaw, resulting in a set of N probabilities or predictions for each sample in the test data. With a sample size of S, the system passes the S sets of N predictions to a logistic regressor along with “ground truth” for the sample dataset to train a logistic regression model. The trained classifiers and the logistic regression model are stored as the classifier ensemble.
Public/Granted literature
- US20200057858A1 OPEN SOURCE VULNERABILITY PREDICTION WITH MACHINE LEARNING ENSEMBLE Public/Granted day:2020-02-20
Information query