Invention Grant
- Patent Title: Automatic generation of training data for supervised machine learning
-
Application No.: US15941815Application Date: 2018-03-30
-
Publication No.: US11270224B2Publication Date: 2022-03-08
- Inventor: Jason James Grams
- Applicant: Konica Minolta Business Solutions U.S.A., Inc.
- Applicant Address: US CA San Mateo
- Assignee: Konica Minolta Business Solutions U.S.A., Inc.
- Current Assignee: Konica Minolta Business Solutions U.S.A., Inc.
- Current Assignee Address: US CA San Mateo
- Agency: Osha Bergman Watanabe & Burton LLP
- Main IPC: G06F16/583
- IPC: G06F16/583 ; G06N20/00 ; G06F16/80 ; G06F16/93

Abstract:
A method is disclosed for training a machine learning model to process electronic documents (EDs). The method includes obtaining a structured ED (SED) from a document repository, where the SED includes a first metadata. The method further generates, based on the SED, a bitmap and a second metadata. The method also determines whether the second metadata is within a predetermined threshold of the first metadata and generates, based on the SED and in response to determining that the second metadata is not within the predetermined threshold of the first metadata, a third metadata. The method additionally determines whether the third metadata is within the predetermined threshold of the first metadata and stores, in response to determining that the third metadata is within the predetermined threshold of the first metadata, a second SED comprising the bitmap and the third metadata.
Public/Granted literature
- US20190303800A1 AUTOMATIC GENERATION OF TRAINING DATA FOR SUPERVISED MACHINE LEARNING Public/Granted day:2019-10-03
Information query