Invention Grant
- Patent Title: Electronic document content extraction and document type determination
-
Application No.: US15870432Application Date: 2018-01-12
-
Publication No.: US10909309B2Publication Date: 2021-02-02
- Inventor: Ali Taleghani , Kathryn V. J. Sullivan , Kevin Roland Powell , Maria del Mar Gines Marin , Theresa A. Estrada , Tev'n J. Powers , Domenic J. Cipollone , Kylan Nieh , Michael Wilson Daniels
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Schwegman Lundberg & Woessner, P.A.
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06F40/166 ; G06Q10/10 ; G06F40/106 ; G06F40/117 ; G06F40/253 ; G06F40/279 ; G06F16/93 ; G06F3/0482 ; G06F16/28 ; G06F16/23 ; G06F16/14 ; G06F16/21 ; G06F16/22 ; G06F16/35 ; G06F16/335 ; G06F16/31 ; G06Q10/06 ; G06Q50/00 ; G06F16/25 ; G06K9/00

Abstract:
A system and method includes receiving content of an electronic document having a document type, the content divided into components each having a unique identifier and selecting an extraction schema based on the document type, the extraction schema having a plurality of data categories. For each of the components, the extraction schema is applied to identify content of the component that corresponds to individual ones of the data categories and saving, with the processor, in an electronic data storage, in a record associated with the component, category metadata indicative of content of the component corresponding to the data categories. In response to obtaining the category metadata for each of the components, applying the extraction schema to the content metadata of each of the components and to the electronic document as a whole to determine document metadata. A user interface displays the document metadata on the user interface.
Public/Granted literature
- US20190138609A1 ELECTRONIC DOCUMENT CONTENT EXTRACTION AND DOCUMENT TYPE DETERMINATION Public/Granted day:2019-05-09
Information query