Unstructured data clustering of information technology service delivery actions
Abstract:
Systems, methods, and computer program products relating to clustering unstructured data. A set of unstructured documents is tokenized to produce a plurality of tokens. A frequency at which terms appear in the plurality of tokens is analyzed, to generate a vocabulary of terms. A vocabulary indices matrix is generated based on the generated vocabulary of terms. The matrix relates to the set of unstructured documents. A plurality of rows in the vocabulary indices matrix are matched to generate a plurality of clusters for the set of unstructured documents.
Information query
Patent Agency Ranking
0/0