Document spatial layout feature extraction to simplify template classification

Invention Grant

US11348353B2 Document spatial layout feature extraction to simplify template classification 有权

Please log in to see more content

Patent Title: Document spatial layout feature extraction to simplify template classification
Application No.: US16779462

Application Date: 2020-01-31
Publication No.: US11348353B2

Publication Date: 2022-05-31
Inventor: Michael Sundell , Vibhas Gejji
Applicant: Automation Anywhere, Inc.
Applicant Address: US CA San Jose
Assignee: Automation Anywhere, Inc.
Current Assignee: Automation Anywhere, Inc.
Current Assignee Address: US CA San Jose
Main IPC: G06V30/414
IPC: G06V30/414 ; G06F16/93 ; G06F17/16 ; G06K9/62 ; G06N3/08 ; G06V30/412

Document spatial layout feature extraction to simplify template classification

Abstract:

Image encoded documents are identified by recognizing known objects in each document with an object recognizer. The objects in each page are filtered to remove lower order objects. Known features in the objects are recognized by sequentially organizing each object in each filtered page into a one-dimensional array, where each object is positioned in a corresponding one-dimensional array as a function of location in the corresponding filtered page. The one-dimensional array is then compared to known arrays to classify the image document corresponding to the one-dimensional array.

Public/Granted literature

US20210240975A1 DOCUMENT SPATIAL LAYOUT FEATURE EXTRACTION TO SIMPLIFY TEMPLATE CLASSIFICATION Public/Granted day:2021-08-05

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V30/00	字符识别；数字墨迹识别；面向文档的基于图像的模式识别（文档等的扫描、传输或复制 H04N1/00）
G06V30/40	.面向文档的基于图像的模式识别
G06V30/41	..文件内容分析（基于代码标记的印刷字符识别G06V30/224）
G06V30/414	...提取几何结构，例如布局树；块分割，例如图形或文本的边界框