Invention Grant
- Patent Title: System and method for automated information extraction from scanned documents
-
Application No.: US17472374Application Date: 2021-09-10
-
Publication No.: US12056171B2Publication Date: 2024-08-06
- Inventor: Nihar Ranjan Sahoo , Mahesh Kshirsagar , Kamlesh Mhashilkar , Pushkar Kurhekar , Shivani Nigam , Shriram Pillai
- Applicant: Tata Consultancy Services Limited
- Applicant Address: IN Mumbai
- Assignee: TATA CONSULTANCY SERVICES LIMITED
- Current Assignee: TATA CONSULTANCY SERVICES LIMITED
- Current Assignee Address: IN Mumbai
- Agency: FINNEGAN, HENDERSON, FARABOW, GARRETT & DUNNER LLP
- Priority: IN 2121001271 2021.01.11
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06F16/35 ; G06F40/186 ; G06V30/413 ; G06V30/416 ; G06V30/10

Abstract:
The problem of ever-increasing huge volume of unstructured data, mainly documents, and within that the scanned documents, needs to have a solution to expedite the overall turnaround time in document centric business processing. Majority of these documents often do not strictly follow a specific format or a template, and creating a generic OCR solution, which would work on any kind of document format is needed to enhance overall efficacy of processes. Embodiments of the present disclosure provide system and method that extract tabular and text information from scanned documents. More specifically, method and system are provided to extract user filled tabular data, textual information, selected radio-buttons and checked checkboxes, stamps, barcodes from scanned copies of any filled form with or without any template being pre-defined or without any prior knowledge about format of input forms. The system converts extracted information in a structured form for further for analytics and reporting.
Public/Granted literature
- US20220222284A1 SYSTEM AND METHOD FOR AUTOMATED INFORMATION EXTRACTION FROM SCANNED DOCUMENTS Public/Granted day:2022-07-14
Information query