-
公开(公告)号:GB2605052A
公开(公告)日:2022-09-21
申请号:GB202207244
申请日:2020-10-20
Applicant: IBM
Inventor: PETER ZHONG , YEPES ANTONIO JOSE JIMENO , ELAHEH SHAFIEBAVANI
IPC: G06K9/00
Abstract: A computer-implemented method for using a machine learning model(122) to automatically extract tabular data from an image includes receiving a set of images of tabular data and a set of markup data corresponding respectively to the images of tabular data. The method further includes training a first neural network to delineate the tabular data into cells(440) using the markup data, and training a second neural network to determine content of the cells(440)in the tabular data using the markup data. The method further includes, upon receiving an input image(112) containing a first tabular data without any markup data, generating an electronic output corresponding to the first tabular data by determining the structure of the first tabular data using the first neural network and extracting content of the first tabular using the second neural network.
-
公开(公告)号:GB2603586A
公开(公告)日:2022-08-10
申请号:GB202116546
申请日:2021-11-17
Applicant: IBM
Inventor: PETER ZHONG , ANTONIO JOSE JIMENO YEPES , LENIN MEHEDY
Abstract: Providing document access control based on document component layouts, a processor detects a layout of a document, the layout including one or more components of the document. The components maybe tables, figures, paragraphs and specific document sections. A processor defines an access policy to access the one or more components based on the layout. A processor authorizes a request to access the one or more components based on the access policy and the layout. A processor retrieves the one or more components based on the access policy and the authorized request, where retrieving the one or more components includes displaying them based on the access level. A layout similarity maybe determined between the document and a second document and based on a similarity threshold dynamically applying the access policy to the second document to retrieve a component of a second document based on the access policy.
-