Invention Grant
- Patent Title: Method and apparatus for separating text and figures in document images
-
Application No.: US16022016Application Date: 2018-06-28
-
Publication No.: US10796145B2Publication Date: 2020-10-06
- Inventor: Valery Valerievich Anisimovskiy
- Applicant: Samsung Electronics Co., Ltd.
- Applicant Address: KR Suwon-si
- Assignee: Samsung Electronics Co., Ltd.
- Current Assignee: Samsung Electronics Co., Ltd.
- Current Assignee Address: KR Suwon-si
- Agency: Jefferson IP Law, LLP
- Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@1f9b1695 com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@68579369
- Main IPC: G06K9/00
- IPC: G06K9/00 ; G06K9/62 ; G06N3/08

Abstract:
A method and apparatus for separating a text and figure of a document image are provided. The method of separating the text and the figure of the document image includes acquiring a document image, dividing the document image into a plurality of regions of interest, acquiring a feature vector by using a two-dimensional (2D) histogram by resizing the regions of interest and extracting a connection component of the regions of interest, acquiring a transformation vector of the feature vector by using a kernel, obtaining a cluster center of the transformation vector, and performing clustering on the cluster center to acquire a supercluster, and classifying the supercluster into one of a text class and a figure class, based on the number of superclusters.
Public/Granted literature
- US20190005324A1 METHOD AND APPARATUS FOR SEPARATING TEXT AND FIGURES IN DOCUMENT IMAGES Public/Granted day:2019-01-03
Information query