Invention Grant
- Patent Title: Document structure identification using post-processing error correction
-
Application No.: US16655365Application Date: 2019-10-17
-
Publication No.: US11321559B2Publication Date: 2022-05-03
- Inventor: Ashutosh Mehra , Md Nadeem Akhtar , Pranav Kumar
- Applicant: Adobe Inc.
- Applicant Address: US CA San Jose
- Assignee: Adobe Inc.
- Current Assignee: Adobe Inc.
- Current Assignee Address: US CA San Jose
- Agency: Finch & Maloney PLLC
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06N20/00

Abstract:
Techniques are disclosed for identifying document structural elements and correcting errors in the classification and/or location of the identified structural elements. An example method includes determining location and classification for a structural element on a page of the document using a machine learning (ML) model; determining one or more errors in the location and/or classification for the structural element; and correcting each instance of the one or more errors using other content in the document (e.g., content spatially adjacent to the corresponding structural element on the page of the document). The method may further include storing the document and the location and classification (as corrected), and/or generating a structural map of the page of the document based on the location and classification (as corrected). The use of the document content to correct errors greatly enhances the agreement between the identified structural elements and the original document.
Public/Granted literature
- US20210117667A1 DOCUMENT STRUCTURE IDENTIFICATION USING POST-PROCESSING ERROR CORRECTION Public/Granted day:2021-04-22
Information query