Electronic document content classification and document type determination
Abstract:
A system and method includes receiving an electronic document having content divided into components and, for each of the components, comparing content of the component with component content characteristics of a classification schema associated with a document type and storing, in an electronic data storage, in a record associated with the component, indications of the component content characteristics of the classification schema that are included in the component. Document content characteristics of the classification schema are compared with the content of the electronic document and storing, in the electronic data storage, in a record for the electronic document, indications of the document content characteristics of the classification schema that are included in the electronic document. The electronic document is identified as the document type based on the content characteristics of the records meeting a document type criteria.
Information query
Patent Agency Ranking
0/0