Invention Grant
- Patent Title: Method and apparatus of text classification
- Patent Title (中): 文本分类方法和装置
-
Application No.: US12996658Application Date: 2010-09-03
-
Publication No.: US09208220B2Publication Date: 2015-12-08
- Inventor: Xiang Sun
- Applicant: Xiang Sun
- Applicant Address: KY Grand Cayman
- Assignee: Alibaba Group Holding Limited
- Current Assignee: Alibaba Group Holding Limited
- Current Assignee Address: KY Grand Cayman
- Agency: Lee & Hayes, PLLC
- Priority: CN201010104512 20100201
- International Application: PCT/US2010/047868 WO 20100903
- International Announcement: WO2011/093925 WO 20110804
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
The present disclosure provides a technique of text categorization to simplify and optimize the classification. In one aspect, a method parses a given text into one or more words; determines a word vector in a spherical space model for one of the one or more words, a number of dimensions of the spherical space being equal to a number of categories, each category corresponding to a spherical space category vector; for each category, determines a distance between a sum of word vectors of the one or more words and the respective category vector; and classifies the text into one or more categories with the shortest distance. The present disclosure also provides an apparatus used to implement the method.
Public/Granted literature
- US20110213777A1 Method and Apparatus of Text Classification Public/Granted day:2011-09-01
Information query