Document coding computer system and method with integrated quality assurance

Invention Grant

US10296850B2 Document coding computer system and method with integrated quality assurance 有权

Please log in to see more content

Patent Title: Document coding computer system and method with integrated quality assurance
Application No.: US14216155

Application Date: 2014-03-17
Publication No.: US10296850B2

Publication Date: 2019-05-21
Inventor: Konstantinos (Constantin) F. Aliferis , Yin Aphinyanaphongs , Alexander Statnikov , Lawrence Fu
Applicant: Konstantinos (Constantin) F. Aliferis , Yin Aphinyanaphongs , Alexander Statnikov , Lawrence Fu
Agent Laurence Weinberger
Main IPC: G06N20/00
IPC: G06N20/00 ; G06N5/02 ; G06Q10/00 ; G06F16/93

Document coding computer system and method with integrated quality assurance

Abstract:

The present invention consists of a computer-implemented system and method for automatically analyzing and coding documents into content categories suitable for high cost, high yield settings where quality and efficiency of classification are essential. A prototypical example application field is legal document predictive coding for purposes of e-discovery and litigation (or litigation readiness) where the automated classification of documents as “responsive” or not must be (a) efficient, (b) accurate, and (c) defensible in court. Many text classification technologies exist but they focus on the relatively simple steps of using a training method on training data, producing a model and testing it on test data. They invariably do not address effectively and simultaneously key quality assurance requirements. The invention applies several data design and validation steps that ensure quality and removal of all possible sources of document classification error or deficiencies. The invention employs multiple classification methods, preprocessing methods, visualization and organization of results, and explanation of models which further enhance predictive quality, but also ease of use of models and user acceptance. The invention can be applied to practically any field where text classification is desired.

Public/Granted literature

US20140279761A1 Document Coding Computer System and Method With Integrated Quality Assurance Public/Granted day:2014-09-18

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N20/00	机器学习