-
公开(公告)号:GB2582722B
公开(公告)日:2021-03-03
申请号:GB202009558
申请日:2018-11-23
Applicant: IBM
Inventor: KEVIN NORTHRUP , CRAIG TRIM , BADR KHAMIS , KARAN SEHGAL , CHANDRASHEKHAR PADOLE , ABISOLA ADENIRAN
Abstract: Methods, computer program products, and systems are presented. The methods include, for instance: obtaining a document image with objects and identifying microblocks corresponding to each object. Analyzing a position of a microblock for collinearity with another microblock based on respective positional characteristics and adjustable collinearity parameters. Collinear microblocks are identified into a macroblock and computational data of a key-value pair is created from the macroblock. A heuristic confidence level is associated with the key-value pair. Also based on data cluster formation, a table may be classified and data extracted.
-
公开(公告)号:GB2582722A
公开(公告)日:2020-09-30
申请号:GB202009558
申请日:2018-11-23
Applicant: IBM
Inventor: KEVIN NORTHRUP , CRAIG TRIM , BADR KHAMIS , KARAN SEHGAL , CHANDRASHEKHAR PADOLE , ABISOLA ADENIRAN
Abstract: Methods, computer program products, and systems are presented. The methods include, for instance: obtaining a document image with objects and identifying microblocks corresponding to each object. Analyzing a position of a microblock for collinearity with another microblock based on respective positional characteristics and adjustable collinearity parameters. Collinear microblocks are identified into a macroblock and computational data of a key- value pair is created from the macroblock. A heuristic confidence level is associated with the key-value pair. Also based on data cluster formation, a table may be classified and data extracted.
-