-
公开(公告)号:US11087077B2
公开(公告)日:2021-08-10
申请号:US17089962
申请日:2020-11-05
Applicant: SAS Institute Inc.
IPC: G06K9/00 , G06F40/169 , G06F16/93 , G06F40/284 , G06F40/186 , G06K9/46 , G06F3/0484
Abstract: Embodiments are generally directed to techniques for extracting contextually structured data from document images, such as by automatically identifying document layout, document data, and/or document metadata in a document image, for instance. Many embodiments are particularly directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. For example, the document template database may include a plurality of templates for identifying/explaining key data elements in various document image formats that can be used to extract contextually structured data from incoming document images with a matching document image format. Several embodiments are particularly directed to automatically identifying and associating document metadata with corresponding document data in a document image, such as for generating a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation of a document may be used to generate a template for the template database.
-
公开(公告)号:US20210200937A1
公开(公告)日:2021-07-01
申请号:US17089962
申请日:2020-11-05
Applicant: SAS Institute Inc.
IPC: G06F40/169 , G06F16/93 , G06F40/284 , G06F3/0484 , G06F40/186 , G06K9/46
Abstract: Embodiments are generally directed to techniques for extracting contextually structured data from document images, such as by automatically identifying document layout, document data, and/or document metadata in a document image, for instance. Many embodiments are particularly directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. For example, the document template database may include a plurality of templates for identifying/explaining key data elements in various document image formats that can be used to extract contextually structured data from incoming document images with a matching document image format. Several embodiments are particularly directed to automatically identifying and associating document metadata with corresponding document data in a document image, such as for generating a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation of a document may be used to generate a template for the template database.
-
公开(公告)号:US20220392047A1
公开(公告)日:2022-12-08
申请号:US17889801
申请日:2022-08-17
Applicant: SAS Institute Inc.
IPC: G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169 , G06F3/04842 , G06V10/40
Abstract: Embodiments are directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Some embodiments utilize breakpoints to enable the system to match different documents with internal variations to a common template. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation may be used to generate a template for the template database.
-
公开(公告)号:US20210110527A1
公开(公告)日:2021-04-15
申请号:US17083568
申请日:2020-10-29
Applicant: SAS Institute Inc.
IPC: G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169
Abstract: Embodiments are generally directed to techniques for extracting contextually structured data from document images, such as by automatically identifying document layout, document data, and/or document metadata in a document image, for instance. Many embodiments are particularly directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. For example, the document template database may include a plurality of templates for identifying/explaining key data elements in various document image formats that can be used to extract contextually structured data from incoming document images with a matching document image format. Several embodiments are particularly directed to automatically identifying and associating document metadata with corresponding document data in a document image, such as for generating a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation of a document may be used to generate a template for the template database.
-
公开(公告)号:US11704785B2
公开(公告)日:2023-07-18
申请号:US17889801
申请日:2022-08-17
Applicant: SAS Institute Inc.
IPC: G06K9/00 , G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169 , G06F3/04842 , G06V10/40 , G06K9/62 , G06V30/10 , G06V30/24 , G06V30/418
CPC classification number: G06T7/0002 , G06F3/04842 , G06F16/81 , G06F16/93 , G06F40/169 , G06F40/186 , G06F40/284 , G06V10/40 , G06K9/6253 , G06K9/6276 , G06T2207/30168 , G06T2207/30176 , G06V30/10 , G06V30/248 , G06V30/418
Abstract: Embodiments are directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Some embodiments utilize breakpoints to enable the system to match different documents with internal variations to a common template. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation may be used to generate a template for the template database.
-
公开(公告)号:US11443416B2
公开(公告)日:2022-09-13
申请号:US17397470
申请日:2021-08-09
Applicant: SAS Institute Inc.
Inventor: Yi Liao , Charles Franklin Board , William Robert Nadolski , David James Wheaton , Heather Michelle Goodykoontz , Adheesha Sanjuaya Arangala , Karthik Nakkeeran
IPC: G06K9/00 , G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169 , G06F3/04842 , G06V10/40 , G06K9/62 , G06V30/10 , G06V30/24 , G06V30/418
Abstract: Various embodiments are generally directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. For example, the contents of cells may be extracted from a table image along with structural context including the corresponding row and column information. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation may be used to generate a template for the template database.
-
公开(公告)号:US20210366099A1
公开(公告)日:2021-11-25
申请号:US17397470
申请日:2021-08-09
Applicant: SAS Institute Inc.
Inventor: Yi Liao , Charles Franklin Board , William Robert Nadolski , David James Wheaton , Heather Michelle Goodykoontz , Adheesha Sanjuaya Arangala , Karthik Nakkeeran
IPC: G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169 , G06F3/0484 , G06K9/46
Abstract: Various embodiments are generally directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. For example, the contents of cells may be extracted from a table image along with structural context including the corresponding row and column information. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation may be used to generate a template for the template database.
-
公开(公告)号:US11049235B2
公开(公告)日:2021-06-29
申请号:US17083568
申请日:2020-10-29
Applicant: SAS Institute Inc.
IPC: G06K9/00 , G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169 , G06K9/68 , G06K9/62
Abstract: Embodiments are generally directed to techniques for extracting contextually structured data from document images, such as by automatically identifying document layout, document data, and/or document metadata in a document image, for instance. Many embodiments are particularly directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. For example, the document template database may include a plurality of templates for identifying/explaining key data elements in various document image formats that can be used to extract contextually structured data from incoming document images with a matching document image format. Several embodiments are particularly directed to automatically identifying and associating document metadata with corresponding document data in a document image, such as for generating a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation of a document may be used to generate a template for the template database.
-
-
-
-
-
-
-