Invention Grant
- Patent Title: Automatic selection of templates for extraction of data from electronic documents
-
Application No.: US18192170Application Date: 2023-03-29
-
Publication No.: US11887395B2Publication Date: 2024-01-30
- Inventor: Hanieh Borhanazad , Jimmy Chandra , Jey Jeyaramanan , Thuwaragan Sundaramoorthy , Mark Burch
- Applicant: Coupa Software Incorporated
- Applicant Address: US CA San Mateo
- Assignee: Coupa Software Incorporated
- Current Assignee: Coupa Software Incorporated
- Current Assignee Address: US CA San Mateo
- Agency: Baker Botts L.L.P.
- Main IPC: G06V30/418
- IPC: G06V30/418 ; G06F40/186 ; G06V30/412

Abstract:
A computer-implemented method for automatic template selection for extracting data from an input electronic document is provided. The method includes receiving a first set of candidate templates and an input electronic document. For each candidate template, a template similarity ratio value is calculated that represents a similarity of the candidate template to the input electronic document. The first set of candidate templates are ranked according to the template similarity ratios and then matched to the input electronic document resulting in generating a normalized similarity score for each particular candidate from among the candidate templates. Differences in normalized similarity scores of successive pairs of the candidate templates is determined and a breaking point is established. A second set of candidate templates is formed by selecting candidate templates that are ranked above the breaking point. Data from the input electronic document is extracted using the second set of candidate templates.
Public/Granted literature
- US20230237829A1 AUTOMATIC SELECTION OF TEMPLATES FOR EXTRACTION OF DATA FROM ELECTRONIC DOCUMENTS Public/Granted day:2023-07-27
Information query