Invention Grant
- Patent Title: Systems and methods for phrase clustering
- Patent Title (中): 短语聚类的系统和方法
-
Application No.: US12946896Application Date: 2010-11-16
-
Publication No.: US08751496B2Publication Date: 2014-06-10
- Inventor: Indrajit Bhattacharya , Shantanu Ravindra Godbole , Akshit Sharma
- Applicant: Indrajit Bhattacharya , Shantanu Ravindra Godbole , Akshit Sharma
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Ference & Associates LLC
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Systems and associated methods for enhanced concept understanding in large document collections through phrase clustering are described. Embodiments take as input an initial set of phrases and estimate centroids using a clustering process. Embodiments then generate new phrases around each of the current centroids using the current phrases. These new phrases are added to the current set, and the clustering process is iterated. Upon convergence, embodiments finalize clusters based on phrases of any given length.
Public/Granted literature
- US20120124044A1 SYSTEMS AND METHODS FOR PHRASE CLUSTERING Public/Granted day:2012-05-17
Information query