Generating machine renderable representations of forms using machine learning
Abstract:
A method may include clustering form elements into line objects and columns of a table of a structured representation by applying a trained multi-dimensional clustering model to spatial coordinates of the form elements, and assigning a table header line type to a table header line object of the line objects based on a spatial coordinate of the table header line object relative to a spatial coordinate of a topmost table data line object of the line objects, and a determination that a number of columns of the table header line object is within a threshold of a number of columns of the topmost table data line object. The topmost table data line object may be assigned a table data line type. The method may further include presenting the structured representation to a user.
Information query
Patent Agency Ranking
0/0