Invention Grant
- Patent Title: Identifying key-value pairs in documents
-
Application No.: US16802864Application Date: 2020-02-27
-
Publication No.: US11288719B2Publication Date: 2022-03-29
- Inventor: Yang Xu , Jiang Wang , Shengyang Dai
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Honigman LLP
- Agent Brett A. Krueger
- Main IPC: G06Q30/04
- IPC: G06Q30/04 ; G06K9/00 ; G06N3/08

Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for converting unstructured documents to structured key-value pairs. In one aspect, a method comprises: providing an image of a document to a detection model, wherein: the detection model is configured to process the image to generate an output that defines one or more bounding boxes generated for the image; and each bounding box generated for the image is predicted to enclose a key-value pair comprising key textual data and value textual data, wherein the key textual data defines a label that characterizes the value textual data; and for each of the one or more bounding boxes generated for the image: identifying textual data enclosed by the bounding box using an optical character recognition technique; and determining whether the textual data enclosed by the bounding box defines a key-value pair.
Public/Granted literature
- US20200273078A1 IDENTIFYING KEY-VALUE PAIRS IN DOCUMENTS Public/Granted day:2020-08-27
Information query