Invention Grant
- Patent Title: Machine learning data extraction algorithms
-
Application No.: US16156636Application Date: 2018-10-10
-
Publication No.: US10824811B2Publication Date: 2020-11-03
- Inventor: Everaldo Aguiar , Jesper Lind
- Applicant: SAP SE
- Applicant Address: DE Walldorf
- Assignee: SAP SE
- Current Assignee: SAP SE
- Current Assignee Address: DE Walldorf
- Agency: Fountainhead Law Group P.C.
- Main IPC: G06F40/295
- IPC: G06F40/295 ; G06K9/46

Abstract:
Embodiments of the present disclosure pertain to extracting data corresponding to particular data types using machine learning algorithms. In one embodiment, a method includes receiving an image in a backend system, sending the image to an optical character recognition (OCR) component, and in accordance therewith, receiving a plurality of characters recognized in the image. The character set is matched against known values to generate candidate character strings. The character set is processed by one or more machine learning algorithms to produce features. For each candidate character string, the features are then processed by a random forest model to determine a final character string.
Public/Granted literature
- US20200042591A1 MACHINE LEARNING DATA EXTRACTION ALGORITHMS Public/Granted day:2020-02-06
Information query