Invention Grant
- Patent Title: Data classification methods using machine learning techniques
- Patent Title (中): 使用机器学习技术的数据分类方法
-
Application No.: US11752719Application Date: 2007-05-23
-
Publication No.: US07937345B2Publication Date: 2011-05-03
- Inventor: Mauritius A. R. Schmidtler , Roland Borrey
- Applicant: Mauritius A. R. Schmidtler , Roland Borrey
- Applicant Address: US CA Irvine
- Assignee: Kofax, Inc.
- Current Assignee: Kofax, Inc.
- Current Assignee Address: US CA Irvine
- Agency: Zilka-Kotab, PC
- Main IPC: G06F15/18
- IPC: G06F15/18

Abstract:
A method for adapting to a shift in document content according to one embodiment of the present invention includes receiving at least one labeled seed document; receiving unlabeled documents; receiving at least one predetermined cost factor; training a transductive classifier using the at least one predetermined cost factor, the at least one seed document, and the unlabeled documents; classifying the unlabeled documents having a confidence level above a predefined threshold into a plurality of categories using the classifier; reclassifying at least some of the categorized documents into the categories using the classifier; and outputting identifiers of the categorized documents to at least one of a user, another system, and another process. Methods for separating documents are also presented. Methods for document searching are also presented.
Public/Granted literature
- US20080086433A1 DATA CLASSIFICATION METHODS USING MACHINE LEARNING TECHNIQUES Public/Granted day:2008-04-10
Information query