Invention Grant
- Patent Title: Clustering of text units using dimensionality reduction of multi-dimensional arrays
- Patent Title (中): 使用多维阵列的维数降低的文本单元的聚类
-
Application No.: US13656315Application Date: 2012-10-19
-
Publication No.: US09141882B1Publication Date: 2015-09-22
- Inventor: Baoqiang Cao , T. Ryan Fitz-Gibbon , Lucas Forehand , Ryan McHale , Bradley Burke
- Applicant: Networked Insights, LLC
- Applicant Address: US WI Madison
- Assignee: NETWORKED INSIGHTS, LLC
- Current Assignee: NETWORKED INSIGHTS, LLC
- Current Assignee Address: US WI Madison
- Agency: Foley & Lardner LLP
- Main IPC: G06E1/00
- IPC: G06E1/00 ; G06E3/00 ; G06F15/18 ; G06G7/00 ; G06K9/62 ; G06F17/30

Abstract:
Methods, systems, and apparatuses, including computer programs encoded on computer-readable media, for tokenizing n-grams from a plurality of text units. A multi-dimensional array is created having a plurality of dimensions based upon the plurality of text units and the n-grams from the plurality of text units. The multi-dimensional array is normalized and the dimensionality of the multi-dimensional array is reduced. The reduced dimensionality multi-dimensional array is clustered to generate a plurality of clusters that each cluster includes one or more of the plurality of text units.
Information query