Invention Grant
- Patent Title: Dynamic document clustering and keyword extraction
-
Application No.: US17157487Application Date: 2021-01-25
-
Publication No.: US11243990B2Publication Date: 2022-02-08
- Inventor: Yehoshua Enuka , Nimrod Vax , Eyal Sacharov , Itamar Apel , David Moyal
- Applicant: BigID Inc.
- Applicant Address: US NY New York
- Assignee: BigID Inc.
- Current Assignee: BigID Inc.
- Current Assignee Address: US NY New York
- Agency: Zeller IP Group, PLLC
- Agent Kyle M. Zeller
- Main IPC: G06F16/35
- IPC: G06F16/35 ; G06F16/31 ; G06K9/00 ; G06F40/205 ; G06F16/13 ; G06K9/62

Abstract:
Systems, methods and apparatuses are disclosed to cluster a plurality of documents located in any number of local and/or remote systems and applications. Preprocessed text is generated for each document, and a hash and a feature vector are determined based on the preprocessed text. A set of clusters is retrieved, wherein each cluster is associated with a hash list and a cumulative feature vector. Each of the documents may then be associated with a cluster by comparing the hash of the document to the hash lists of the clusters and/or by determining similarities between the feature vector of the document and the cumulative feature vectors of the clusters.
Public/Granted literature
- US20210150204A1 Dynamic Document Clustering and Keyword Extraction Public/Granted day:2021-05-20
Information query