Method and apparatus for separating text and figures in document images

Invention Grant

US10796145B2 Method and apparatus for separating text and figures in document images 有权

Please log in to see more content

Patent Title: Method and apparatus for separating text and figures in document images
Application No.: US16022016

Application Date: 2018-06-28
Publication No.: US10796145B2

Publication Date: 2020-10-06
Inventor: Valery Valerievich Anisimovskiy
Applicant: Samsung Electronics Co., Ltd.
Applicant Address: KR Suwon-si
Assignee: Samsung Electronics Co., Ltd.
Current Assignee: Samsung Electronics Co., Ltd.
Current Assignee Address: KR Suwon-si
Agency: Jefferson IP Law, LLP
Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@1f9b1695 com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@68579369
Main IPC: G06K9/00
IPC: G06K9/00 ; G06K9/62 ; G06N3/08

Method and apparatus for separating text and figures in document images

Abstract:

A method and apparatus for separating a text and figure of a document image are provided. The method of separating the text and the figure of the document image includes acquiring a document image, dividing the document image into a plurality of regions of interest, acquiring a feature vector by using a two-dimensional (2D) histogram by resizing the regions of interest and extracting a connection component of the regions of interest, acquiring a transformation vector of the feature vector by using a kernel, obtaining a cluster center of the transformation vector, and performing clustering on the cluster center to acquire a supercluster, and classifying the supercluster into one of a text class and a figure class, based on the number of superclusters.

Public/Granted literature

US20190005324A1 METHOD AND APPARATUS FOR SEPARATING TEXT AND FIGURES IN DOCUMENT IMAGES Public/Granted day:2019-01-03

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )