Abstract:
Various embodiments are generally directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. For example, the contents of cells may be extracted from a table image along with structural context including the corresponding row and column information. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation may be used to generate a template for the template database.
Abstract:
Various embodiments are generally directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. For example, the contents of cells may be extracted from a table image along with structural context including the corresponding row and column information. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation may be used to generate a template for the template database.
Abstract:
Computer-implemented systems and methods are provided for determining an overall mood score of a document. For example, the document is received from a computer-readable medium. A text segment in a document is identified to be indicative of a mood of the document. The text segment is mapped to a mood scale among a predetermined set of mood scales. A mood weight associated with the mood scale for the text segment is generated. An overall mood score of the document is determined based at least in part on the mood weight.
Abstract:
Computer-implemented systems and methods are provided for determining an overall mood score of a document. For example, the document is received from a computer-readable medium. A text segment in a document is identified to be indicative of a mood of the document. The text segment is mapped to a mood scale among a predetermined set of mood scales. A mood weight associated with the mood scale for the text segment is generated. An overall mood score of the document is determined based at least in part on the mood weight.