Invention Grant
- Patent Title: System and method for detecting data drift
-
Application No.: US16913256Application Date: 2020-06-26
-
Publication No.: US11544634B2Publication Date: 2023-01-03
- Inventor: Vathy M. Kamulete
- Applicant: ROYAL BANK OF CANADA
- Applicant Address: CA Toronto
- Assignee: ROYAL BANK OF CANADA
- Current Assignee: ROYAL BANK OF CANADA
- Current Assignee Address: CA Toronto
- Agency: Norton Rose Fulbright Canada LLP
- Main IPC: G06N20/20
- IPC: G06N20/20 ; G06N5/00 ; G06N7/00

Abstract:
Data drift or dataset shift is detected between training dataset and test dataset by training a scoring function using a pooled dataset, the pooled dataset including a union of the training dataset and the test dataset; obtaining an outlier score for each instance in the training dataset and the test dataset based at least in part on the scoring function; assigning a weight to each outlier score based at least in part on training contamination rates; determining a test statistic based at least in part on the outlier scores and the weights; determining a null distribution of no dataset shift for the test statistic; determining a threshold in the null distribution; and when the test statistic is greater than or equal to the threshold, identifying dataset shift between the training dataset and the test dataset.
Information query