Invention Grant
- Patent Title: Auto-maintained document classification
- Patent Title (中): 自动维护的文档分类
-
Application No.: US14492914Application Date: 2014-09-22
-
Publication No.: US09195947B2Publication Date: 2015-11-24
- Inventor: Yigal S. Dayan , Gil Fuchs , Josemina M. Magdalen , Irit Maharian , Yariv Tzaban
- Applicant: International Business Machines Corporation
- Applicant Address: KY Grand Cayman
- Assignee: GLOBALFOUNDRIES INC.
- Current Assignee: GLOBALFOUNDRIES INC.
- Current Assignee Address: KY Grand Cayman
- Agency: Edell, Shapiro & Finnan, LLC
- Main IPC: G06N5/04
- IPC: G06N5/04 ; G06K9/62 ; G06N99/00

Abstract:
Machines, systems and methods for maintaining a representative data set in a document classification system, the method comprising: including an initial set of seed representative data in a representative data set (RDS) implemented for a knowledge base (KB), wherein the KB is trained to classify documents provided to a document classification system based on analysis of the representative documents included in the RDS and a set of rules, wherein the seed representative data includes a balanced number of representative data across a plurality of classes; updating the RDS by adding or removing representative data from the RDS based on feedback received about accuracy of classification of one or more documents by the classification system; and retraining the KB, wherein the retraining is performed based on occurrence of one or more events.
Public/Granted literature
- US20150012470A1 AUTO-MAINTAINED DOCUMENT CLASSIFICATION Public/Granted day:2015-01-08
Information query