Invention Grant
- Patent Title: Creation of a summary for a plurality of texts
-
Application No.: US15243894Application Date: 2016-08-22
-
Publication No.: US10430450B2Publication Date: 2019-10-01
- Inventor: Yu Gu , Takayuki Kushida , Hiroki Nakano , Yaoping Ruan , Yuji Sugiyama
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Edell, Shapiro & Finnan, LLC
- Agent Ryan Lewis
- Main IPC: G06F16/34
- IPC: G06F16/34 ; G06F16/35 ; G06F16/901 ; G06F11/30

Abstract:
Creating a summary of a plurality of texts includes tokenizing each of a plurality of texts to obtain tokens; generating a vector space using a first set of vectors having one or more obtained feature scores equal to or larger than a predefined value; executing non-hierarchical clustering using the vector space to generate a first plurality of clusters; choosing a first representative text in each of the plurality of clusters; generating a second set of vectors from each of the arrays generated based on a number of characters included in tokens of the representative texts; executing hierarchical clustering using the second set of vectors to generate a second plurality of clusters; and in response to a determining a number of clusters included in the second plurality of clusters, determining a second representative text for each of the clusters included in the second plurality of clusters.
Public/Granted literature
- US20180052918A1 CREATION OF A SUMMARY FOR A PLURALITY OF TEXTS Public/Granted day:2018-02-22
Information query