-
公开(公告)号:US10552739B1
公开(公告)日:2020-02-04
申请号:US16503742
申请日:2019-07-05
Applicant: SAS Institute Inc.
Inventor: Nancy Anne Rausch , Roger Jay Barney , John P. Trawinski
Abstract: An apparatus includes a processor to: provide a set of feature routines to a set of processor cores to detect features of a data set distributed thereamong; generate metadata indicative of the detected features; generate context data indicative of contextual aspects of the data set; provide the metadata and context data to each processor core, and distribute a set of suggestion models thereamong to enable derivation of a suggested subset of data preparation operations to be suggested to be performed on the data set; transmit indications of the suggested subset to a viewing device, and receive therefrom indications of a selected subset of data preparation operations selected to be performed; compare the selected and suggested subsets; and in response to differences therebetween, re-train at least one suggestion model of the set of suggestion models based at least on the combination of the metadata, context data and selected subset.
-
公开(公告)号:US10909460B2
公开(公告)日:2021-02-02
申请号:US16726339
申请日:2019-12-24
Applicant: SAS Institute Inc.
Inventor: Nancy Anne Rausch , Roger Jay Barney , John P. Trawinski
Abstract: An apparatus includes a processor to: provide a set of feature routines to a set of processor cores to detect features of a data set distributed thereamong; generate metadata indicative of the detected features; generate context data indicative of contextual aspects of the data set; provide the metadata and context data to each processor core, and distribute a set of suggestion models thereamong to enable derivation of a suggested subset of data preparation operations to be suggested to be performed on the data set; transmit indications of the suggested subset to a viewing device, and receive therefrom indications of a selected subset of data preparation operations selected to be performed; compare the selected and suggested subsets; and in response to differences therebetween, re-train at least one suggestion model of the set of suggestion models based at least on the combination of the metadata, context data and selected subset.
-
公开(公告)号:US09753767B2
公开(公告)日:2017-09-05
申请号:US15431573
申请日:2017-02-13
Applicant: SAS Institute Inc.
Inventor: Nancy Anne Rausch , Ronald Agresta , Roger Jay Barney , Willem Abraham Hazejager
CPC classification number: G06F9/4843 , G06F9/4806 , G06F9/4881 , G06F17/30424 , G06F17/30445
Abstract: An apparatus may include a processor and storage to store instructions that cause the processor to perform operations including: generate a current data set model descriptive of a characteristic of a current data set; compare the current data set model to at least one previously generated data set model descriptive of a characteristic of a previously analyzed data set; in response to detection of a match within a similarity threshold: retrieve an indication from a correlation database of an action previously performed on a previously analyzed data set; select a computer language based on node data descriptive of characteristics of a node device execution environment; generate node instructions in the selected computer language and based on the current data set model to cause the node device to perform the previously performed action on a portion of the current data set; and transmit the node instructions to the node device.
-
公开(公告)号:US11341414B2
公开(公告)日:2022-05-24
申请号:US17165226
申请日:2021-02-02
Applicant: SAS Institute Inc.
Inventor: Nancy Anne Rausch , Roger Jay Barney , John P. Trawinski
Abstract: An apparatus includes processor(s) to: receive a request for a data catalog; in response to the request specifying a structural feature, analyze metadata of multiple data sets for an indication of including it, and to retrieve an indicated degree of certainty of detecting it for data sets including it; in response to the request specifying a contextual aspect, analyze context data of the multiple data sets for an indication of being subject to it, and to retrieve an indicated degree of certainty concerning it for data sets subject to it; selectively include each data set in the data catalog based on the request specifying a structural feature and/or a contextual aspect, and whether each data set meets what is specified; for each data set in the data catalog, generate a score indicative of the likelihood of meeting what is specified; and transmit the data catalog to the requesting device.
-
公开(公告)号:US20210158171A1
公开(公告)日:2021-05-27
申请号:US17165226
申请日:2021-02-02
Applicant: SAS Institute Inc.
Inventor: Nancy Anne Rausch , Roger Jay Barney , John P. Trawinski
Abstract: An apparatus includes processor(s) to: receive a request for a data catalog; in response to the request specifying a structural feature, analyze metadata of multiple data sets for an indication of including it, and to retrieve an indicated degree of certainty of detecting it for data sets including it; in response to the request specifying a contextual aspect, analyze context data of the multiple data sets for an indication of being subject to it, and to retrieve an indicated degree of certainty concerning it for data sets subject to it; selectively include each data set in the data catalog based on the request specifying a structural feature and/or a contextual aspect, and whether each data set meets what is specified; for each data set in the data catalog, generate a score indicative of the likelihood of meeting what is specified; and transmit the data catalog to the requesting device.
-
公开(公告)号:US20170153914A1
公开(公告)日:2017-06-01
申请号:US15431573
申请日:2017-02-13
Applicant: SAS Institute Inc.
Inventor: Nancy Anne Rausch , Ronald Agresta , Roger Jay Barney , Willem Abraham Hazejager
CPC classification number: G06F9/4843 , G06F9/4806 , G06F9/4881 , G06F17/30424 , G06F17/30445
Abstract: An apparatus may include a processor and storage to store instructions that cause the processor to perform operations including: generate a current data set model descriptive of a characteristic of a current data set; compare the current data set model to at least one previously generated data set model descriptive of a characteristic of a previously analyzed data set; in response to detection of a match within a similarity threshold: retrieve an indication from a correlation database of an action previously performed on a previously analyzed data set; select a computer language based on node data descriptive of characteristics of a node device execution environment; generate node instructions in the selected computer language and based on the current data set model to cause the node device to perform the previously performed action on a portion of the current data set; and transmit the node instructions to the node device.
-
-
-
-
-