Invention Grant
- Patent Title: Method for determining output data for a plurality of text documents
-
Application No.: US16385054Application Date: 2019-04-16
-
Publication No.: US11263251B2Publication Date: 2022-03-01
- Inventor: Mark Buckley
- Applicant: Siemens Aktiengesellshaft
- Applicant Address: DE Munich
- Assignee: Siemens Aktiengesellshaft
- Current Assignee: Siemens Aktiengesellshaft
- Current Assignee Address: DE Munich
- Agency: Schmeiser, Olsen & Watts LLP
- Priority: EP18168202 20180419
- Main IPC: G06F16/35
- IPC: G06F16/35 ; G06F40/30 ; G06F40/284

Abstract:
Provided is a method for determining output data for a plurality of text documents, including the steps of: providing a feature matrix as input data; wherein the feature matrix includes information about frequencies of a plurality of features within the plurality of text documents; clustering the feature matrix using a clustering algorithm into at least one clustering matrix; wherein the at least one clustering matrix includes information about the cluster membership of each document of the plurality of documents or each feature of the plurality of features, assigning at least one score to each feature of the plurality of features based on the at least one clustering matrix; ranking the plurality of features based on their assigned scores; and outputting the ranked features as output data. A corresponding computer program product and system is also provided.
Public/Granted literature
- US20190325026A1 METHOD FOR DETERMINING OUTPUT DATA FOR A PLURALITY OF TEXT DOCUMENTS Public/Granted day:2019-10-24
Information query