Patent search ap:("SAS Institute Inc.") AND inv:"David James Wheaton" Page 1

1.

发明授权
Techniques for extracting contextually structured data from document images 有权

公开(公告)号：US11087077B2

公开(公告)日：2021-08-10

申请号：US17089962

申请日：2020-11-05

Applicant: SAS Institute Inc.

Inventor： David James Wheaton , William Robert Nadolski , Heather Michelle GoodyKoontz

IPC: G06K9/00 , G06F40/169 , G06F16/93 , G06F40/284 , G06F40/186 , G06K9/46 , G06F3/0484

Abstract: Embodiments are generally directed to techniques for extracting contextually structured data from document images, such as by automatically identifying document layout, document data, and/or document metadata in a document image, for instance. Many embodiments are particularly directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. For example, the document template database may include a plurality of templates for identifying/explaining key data elements in various document image formats that can be used to extract contextually structured data from incoming document images with a matching document image format. Several embodiments are particularly directed to automatically identifying and associating document metadata with corresponding document data in a document image, such as for generating a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation of a document may be used to generate a template for the template database.

2.

发明申请
TECHNIQUES FOR EXTRACTING CONTEXTUALLY STRUCTURED DATA FROM DOCUMENT IMAGES 有权

公开(公告)号：US20210200937A1

公开(公告)日：2021-07-01

申请号：US17089962

申请日：2020-11-05

Applicant: SAS Institute Inc.

Inventor： David James Wheaton , William Robert Nadolski , Heather Michelle GoodyKoontz

IPC: G06F40/169 , G06F16/93 , G06F40/284 , G06F3/0484 , G06F40/186 , G06K9/46

Abstract: Embodiments are generally directed to techniques for extracting contextually structured data from document images, such as by automatically identifying document layout, document data, and/or document metadata in a document image, for instance. Many embodiments are particularly directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. For example, the document template database may include a plurality of templates for identifying/explaining key data elements in various document image formats that can be used to extract contextually structured data from incoming document images with a matching document image format. Several embodiments are particularly directed to automatically identifying and associating document metadata with corresponding document data in a document image, such as for generating a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation of a document may be used to generate a template for the template database.

3.

发明申请
TECHNIQUES FOR IMAGE CONTENT EXTRACTION 有权

公开(公告)号：US20220392047A1

公开(公告)日：2022-12-08

申请号：US17889801

申请日：2022-08-17

Applicant: SAS Institute Inc.

Inventor： David James Wheaton , Stuart Dakari Cooke, III , William Robert Nadolski

IPC: G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169 , G06F3/04842 , G06V10/40

Abstract: Embodiments are directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Some embodiments utilize breakpoints to enable the system to match different documents with internal variations to a common template. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation may be used to generate a template for the template database.

4.

发明申请
TECHNIQUES FOR EXTRACTING CONTEXTUALLY STRUCTURED DATA FROM DOCUMENT IMAGES 有权

公开(公告)号：US20210110527A1

公开(公告)日：2021-04-15

申请号：US17083568

申请日：2020-10-29

Applicant: SAS Institute Inc.

Inventor： David James Wheaton , William Robert Nadolski , Heather Michelle GoodyKoontz

IPC: G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169

Abstract: Embodiments are generally directed to techniques for extracting contextually structured data from document images, such as by automatically identifying document layout, document data, and/or document metadata in a document image, for instance. Many embodiments are particularly directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. For example, the document template database may include a plurality of templates for identifying/explaining key data elements in various document image formats that can be used to extract contextually structured data from incoming document images with a matching document image format. Several embodiments are particularly directed to automatically identifying and associating document metadata with corresponding document data in a document image, such as for generating a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation of a document may be used to generate a template for the template database.

5.

发明授权
Techniques for image content extraction 有权

公开(公告)号：US11704785B2

公开(公告)日：2023-07-18

申请号：US17889801

申请日：2022-08-17

Applicant: SAS Institute Inc.

Inventor： David James Wheaton , Stuart Dakari Cooke, III , William Robert Nadolski

IPC: G06K9/00 , G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169 , G06F3/04842 , G06V10/40 , G06K9/62 , G06V30/10 , G06V30/24 , G06V30/418

CPC classification number: G06T7/0002 , G06F3/04842 , G06F16/81 , G06F16/93 , G06F40/169 , G06F40/186 , G06F40/284 , G06V10/40 , G06K9/6253 , G06K9/6276 , G06T2207/30168 , G06T2207/30176 , G06V30/10 , G06V30/248 , G06V30/418

Abstract: Embodiments are directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Some embodiments utilize breakpoints to enable the system to match different documents with internal variations to a common template. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation may be used to generate a template for the template database.

6.

发明授权
Techniques for image content extraction 有权

公开(公告)号：US11443416B2

公开(公告)日：2022-09-13

申请号：US17397470

申请日：2021-08-09

Applicant: SAS Institute Inc.

Inventor： Yi Liao , Charles Franklin Board , William Robert Nadolski , David James Wheaton , Heather Michelle Goodykoontz , Adheesha Sanjuaya Arangala , Karthik Nakkeeran

IPC: G06K9/00 , G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169 , G06F3/04842 , G06V10/40 , G06K9/62 , G06V30/10 , G06V30/24 , G06V30/418

Abstract: Various embodiments are generally directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. For example, the contents of cells may be extracted from a table image along with structural context including the corresponding row and column information. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation may be used to generate a template for the template database.

7.

发明申请
TECHNIQUES FOR IMAGE CONTENT EXTRACTION 有权

公开(公告)号：US20210366099A1

公开(公告)日：2021-11-25

申请号：US17397470

申请日：2021-08-09

Applicant: SAS Institute Inc.

Inventor： Yi Liao , Charles Franklin Board , William Robert Nadolski , David James Wheaton , Heather Michelle Goodykoontz , Adheesha Sanjuaya Arangala , Karthik Nakkeeran

IPC: G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169 , G06F3/0484 , G06K9/46

Abstract: Various embodiments are generally directed to techniques for image content extraction. Some embodiments include extracting contextually structured data from document images, such as by automatically identifying document layout, document data, document metadata, and/or correlations therebetween in a document image, for instance. Several embodiments include extracting contextually structured data from table images, such as gridded and non-gridded tables. For example, the contents of cells may be extracted from a table image along with structural context including the corresponding row and column information. Many embodiments are directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. Several embodiments are directed to automatically identifying and associating document metadata with corresponding document data in a document image to generate a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation may be used to generate a template for the template database.

8.

发明授权
Techniques for extracting contextually structured data from document images 有权

公开(公告)号：US11049235B2

公开(公告)日：2021-06-29

申请号：US17083568

申请日：2020-10-29

Applicant: SAS Institute Inc.

Inventor： David James Wheaton , William Robert Nadolski , Heather Michelle GoodyKoontz

IPC: G06K9/00 , G06T7/00 , G06F16/81 , G06F16/93 , G06F40/284 , G06F40/186 , G06F40/169 , G06K9/68 , G06K9/62

Abstract: Embodiments are generally directed to techniques for extracting contextually structured data from document images, such as by automatically identifying document layout, document data, and/or document metadata in a document image, for instance. Many embodiments are particularly directed to generating and utilizing a document template database for automatically extracting document image contents into a contextually structured format. For example, the document template database may include a plurality of templates for identifying/explaining key data elements in various document image formats that can be used to extract contextually structured data from incoming document images with a matching document image format. Several embodiments are particularly directed to automatically identifying and associating document metadata with corresponding document data in a document image, such as for generating a machine-facilitated annotation of the document image. In some embodiments, the machine-facilitated annotation of a document may be used to generate a template for the template database.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification