-
公开(公告)号:US12277144B2
公开(公告)日:2025-04-15
申请号:US18221684
申请日:2023-07-13
Applicant: SAS INSTITUTE INC.
Inventor: Nancy Anne Rausch , Ruth Oluwadamilola Akintunde , Brant Nathan Kay
IPC: G06F40/284 , G06F16/242 , G06F16/28
Abstract: A computer-implemented system includes identifying a target hierarchical taxonomy comprising a plurality of distinct hierarchical taxonomy categories; extracting a plurality of distinct taxonomy tokens from the plurality of distinct hierarchical taxonomy categories; computing a taxonomy vector corpus based on the plurality of distinct taxonomy tokens; computing a plurality of distinct taxonomy clusters based on an input of the taxonomy vector corpus; constructing a hierarchical taxonomy classifier based on the plurality of distinct taxonomy clusters; converting a volume of unlabeled structured datasets to a plurality of distinct corpora of taxonomy-labeled structured datasets based on the hierarchical taxonomy classifier; and outputting at least one corpus of taxonomy-labeled structured datasets of the plurality of distinct corpora of taxonomy-labeled structured datasets based on an input of a data classification query.
-
2.
公开(公告)号:US20240028621A1
公开(公告)日:2024-01-25
申请号:US18221684
申请日:2023-07-13
Applicant: SAS INSTITUTE INC.
Inventor: Nancy Anne Rausch , Ruth Oluwadamilola Akintunde , Brant Nathan Kay
IPC: G06F16/28
CPC classification number: G06F16/287
Abstract: A computer-implemented system includes identifying a target hierarchical taxonomy comprising a plurality of distinct hierarchical taxonomy categories; extracting a plurality of distinct taxonomy tokens from the plurality of distinct hierarchical taxonomy categories; computing a taxonomy vector corpus based on the plurality of distinct taxonomy tokens; computing a plurality of distinct taxonomy clusters based on an input of the taxonomy vector corpus; constructing a hierarchical taxonomy classifier based on the plurality of distinct taxonomy clusters; converting a volume of unlabeled structured datasets to a plurality of distinct corpora of taxonomy-labeled structured datasets based on the hierarchical taxonomy classifier; and outputting at least one corpus of taxonomy-labeled structured datasets of the plurality of distinct corpora of taxonomy-labeled structured datasets based on an input of a data classification query.
-
公开(公告)号:US12190219B1
公开(公告)日:2025-01-07
申请号:US18824828
申请日:2024-09-04
Applicant: SAS INSTITUTE INC.
Inventor: Joseph O. Nyangon , Ruth Oluwadamilola Akintunde
IPC: G06N20/00
Abstract: A computer-program product, computer-implemented method, and computer-implemented system includes obtaining a raw dataset; executing an outlier filtration process based on obtaining the raw dataset; training a model using a refined outlier-reduced dataset; and predicting, via the trained model, a value of the target entity at a future time.
-
公开(公告)号:US11841851B1
公开(公告)日:2023-12-12
申请号:US18124299
申请日:2023-03-21
Applicant: SAS INSTITUTE INC.
Inventor: Nancy Anne Rausch , Ruth Oluwadamilola Akintunde , Brant Nathan Kay
IPC: G06F16/242 , G06F40/284 , G06F16/28
CPC classification number: G06F16/2428 , G06F16/287 , G06F40/284
Abstract: A system, method, and computer-program product includes identifying a target hierarchical taxonomy that includes a plurality of distinct hierarchical taxonomy categories, extracting a plurality of distinct taxonomy tokens from the plurality of distinct hierarchical taxonomy categories, computing a taxonomy vector corpus based on the plurality of distinct taxonomy tokens, computing a plurality of distinct taxonomy clusters based on an input of the taxonomy vector corpus, constructing a hierarchical taxonomy classifier based on the plurality of distinct taxonomy clusters, converting a volume of unlabeled structured datasets to a plurality of distinct corpora of taxonomy-labeled structured datasets based on the hierarchical taxonomy classifier, and outputting at least one corpus of taxonomy-labeled structured datasets of the plurality of distinct corpora of taxonomy-labeled structured datasets based on an input of a data classification query.
-
公开(公告)号:US11809460B1
公开(公告)日:2023-11-07
申请号:US18221695
申请日:2023-07-13
Applicant: SAS INSTITUTE INC.
Inventor: Nancy Anne Rausch , Ruth Oluwadamilola Akintunde , Brant Nathan Kay
IPC: G06F16/28
CPC classification number: G06F16/287
Abstract: A computer-implemented system includes identifying a target hierarchical taxonomy comprising a plurality of distinct hierarchical taxonomy categories; extracting a plurality of distinct taxonomy tokens from the plurality of distinct hierarchical taxonomy categories; computing a taxonomy vector corpus based on the plurality of distinct taxonomy tokens; computing a plurality of distinct taxonomy clusters based on an input of the taxonomy vector corpus; constructing a hierarchical taxonomy classifier based on the plurality of distinct taxonomy clusters; converting a volume of unlabeled structured datasets to a plurality of distinct corpora of taxonomy-labeled structured datasets based on the hierarchical taxonomy classifier; and outputting at least one corpus of taxonomy-labeled structured datasets of the plurality of distinct corpora of taxonomy-labeled structured datasets based on an input of a data classification query.
-
-
-
-