Systems, methods, and graphical user interfaces for taxonomy-based classification of unlabeled structured datasets

    公开(公告)号:US12277144B2

    公开(公告)日:2025-04-15

    申请号:US18221684

    申请日:2023-07-13

    Abstract: A computer-implemented system includes identifying a target hierarchical taxonomy comprising a plurality of distinct hierarchical taxonomy categories; extracting a plurality of distinct taxonomy tokens from the plurality of distinct hierarchical taxonomy categories; computing a taxonomy vector corpus based on the plurality of distinct taxonomy tokens; computing a plurality of distinct taxonomy clusters based on an input of the taxonomy vector corpus; constructing a hierarchical taxonomy classifier based on the plurality of distinct taxonomy clusters; converting a volume of unlabeled structured datasets to a plurality of distinct corpora of taxonomy-labeled structured datasets based on the hierarchical taxonomy classifier; and outputting at least one corpus of taxonomy-labeled structured datasets of the plurality of distinct corpora of taxonomy-labeled structured datasets based on an input of a data classification query.

    SYSTEMS, METHODS, AND GRAPHICAL USER INTERFACES FOR TAXONOMY-BASED CLASSIFICATION OF UNLABELED STRUCTURED DATASETS

    公开(公告)号:US20240028621A1

    公开(公告)日:2024-01-25

    申请号:US18221684

    申请日:2023-07-13

    CPC classification number: G06F16/287

    Abstract: A computer-implemented system includes identifying a target hierarchical taxonomy comprising a plurality of distinct hierarchical taxonomy categories; extracting a plurality of distinct taxonomy tokens from the plurality of distinct hierarchical taxonomy categories; computing a taxonomy vector corpus based on the plurality of distinct taxonomy tokens; computing a plurality of distinct taxonomy clusters based on an input of the taxonomy vector corpus; constructing a hierarchical taxonomy classifier based on the plurality of distinct taxonomy clusters; converting a volume of unlabeled structured datasets to a plurality of distinct corpora of taxonomy-labeled structured datasets based on the hierarchical taxonomy classifier; and outputting at least one corpus of taxonomy-labeled structured datasets of the plurality of distinct corpora of taxonomy-labeled structured datasets based on an input of a data classification query.

    Systems, methods, and graphical user interfaces for taxonomy-based classification of unlabeled structured datasets

    公开(公告)号:US11841851B1

    公开(公告)日:2023-12-12

    申请号:US18124299

    申请日:2023-03-21

    CPC classification number: G06F16/2428 G06F16/287 G06F40/284

    Abstract: A system, method, and computer-program product includes identifying a target hierarchical taxonomy that includes a plurality of distinct hierarchical taxonomy categories, extracting a plurality of distinct taxonomy tokens from the plurality of distinct hierarchical taxonomy categories, computing a taxonomy vector corpus based on the plurality of distinct taxonomy tokens, computing a plurality of distinct taxonomy clusters based on an input of the taxonomy vector corpus, constructing a hierarchical taxonomy classifier based on the plurality of distinct taxonomy clusters, converting a volume of unlabeled structured datasets to a plurality of distinct corpora of taxonomy-labeled structured datasets based on the hierarchical taxonomy classifier, and outputting at least one corpus of taxonomy-labeled structured datasets of the plurality of distinct corpora of taxonomy-labeled structured datasets based on an input of a data classification query.

    Systems, methods, and graphical user interfaces for taxonomy-based classification of unlabeled structured datasets

    公开(公告)号:US11809460B1

    公开(公告)日:2023-11-07

    申请号:US18221695

    申请日:2023-07-13

    CPC classification number: G06F16/287

    Abstract: A computer-implemented system includes identifying a target hierarchical taxonomy comprising a plurality of distinct hierarchical taxonomy categories; extracting a plurality of distinct taxonomy tokens from the plurality of distinct hierarchical taxonomy categories; computing a taxonomy vector corpus based on the plurality of distinct taxonomy tokens; computing a plurality of distinct taxonomy clusters based on an input of the taxonomy vector corpus; constructing a hierarchical taxonomy classifier based on the plurality of distinct taxonomy clusters; converting a volume of unlabeled structured datasets to a plurality of distinct corpora of taxonomy-labeled structured datasets based on the hierarchical taxonomy classifier; and outputting at least one corpus of taxonomy-labeled structured datasets of the plurality of distinct corpora of taxonomy-labeled structured datasets based on an input of a data classification query.

Patent Agency Ranking