Invention Grant
- Patent Title: Text classification with confidence grading
- Patent Title (中): 具有置信分级的文本分类
-
Application No.: US13084584Application Date: 2011-04-12
-
Publication No.: US08650136B2Publication Date: 2014-02-11
- Inventor: Ram Dayal Goyal
- Applicant: Ram Dayal Goyal
- Applicant Address: US CA San Jose
- Assignee: Ketera Technologies, Inc.
- Current Assignee: Ketera Technologies, Inc.
- Current Assignee Address: US CA San Jose
- Agency: Lipion, Weinberger & Husick
- Agent Ash Tankha
- Priority: IN543/CHE/2011 20110224
- Main IPC: G06F15/18
- IPC: G06F15/18

Abstract:
A computer implemented method and system is provided for classifying a document. A classifier is trained using training documents. A list of first words is obtained from the training documents. A prior probability is determined for each class of multiple classes. Conditional probabilities are calculated for the first words for each class. Confidence thresholds are determined. Confidence grades are defined for the classes using the confidence thresholds. A list of second words is obtained from the document. Conditional probabilities for the list of second words are determined from the calculated conditional probabilities for the list of first words. A posterior probability is calculated for each of the classes and compared with the determined confidence thresholds. Each class is assigned to one of the defined confidence grades based on the comparison. The document is assigned to one of the classes based on the posterior probability and the assigned confidence grades.
Public/Granted literature
- US20120221496A1 Text Classification With Confidence Grading Public/Granted day:2012-08-30
Information query