Document image binarization method based on content type separation

    公开(公告)号:US09965695B1

    公开(公告)日:2018-05-08

    申请号:US15394841

    申请日:2016-12-30

    Inventor: Wei Ming

    Abstract: A method for binarizing a grayscale document image, which first divides the document image into a plurality of sub-images and determining a type of each sub-image based on a horizontal projection profile and a density of each sub-image, the type being 1: text only, 2: graphics only, 3: photo only, 4: text and graphics, 5: text and photo, 6: graphics and photo, or 7: text and graphics and photo. Then a selected one of first to seventh binarization processes is applied to binarize each sub-image based on its type to generate a binary sub-image. All binary sub-images are then combine to generate a binary image of the grayscale document image. Of the first to seventh binarization processes respectively applied to the first to seventh types of sub-images, at least those for the first, second, third, fifth, sixth and seventh type are different from each other.

    METHOD FOR SEGMENTING TEXT WORDS IN DOCUMENT IMAGES USING VERTICAL PROJECTIONS OF CENTER ZONES OF CHARACTERS
    12.
    发明申请
    METHOD FOR SEGMENTING TEXT WORDS IN DOCUMENT IMAGES USING VERTICAL PROJECTIONS OF CENTER ZONES OF CHARACTERS 有权
    使用中心区域的垂​​直投影在文档图像中分隔文本词的方法

    公开(公告)号:US20160180163A1

    公开(公告)日:2016-06-23

    申请号:US14578066

    申请日:2014-12-19

    Inventor: Wei Ming

    Abstract: A word segmentation method for segmenting a text line into word segments, which is particularly advantageous for processing italic text but can also be used for regular text. A horizontal center zone of the text line, corresponding to the vertical center parts of the characters, is used to generate a center-zone-only vertical projection profile. The center zone is determined using a horizontal projection profile, by locating the two major peaks of that profile and defining the two major peak positions as the upper and lower boundaries of the center zone. Spacing segments (white gaps) in the vertical projection profile are identified, and classified into two classes, namely character spacing (gap between characters with a word) and word spacing (gap between words). The word spacings are used to segment the text line into word segments.

    Abstract translation: 一种用于将文本行分割成单词段的单词分割方法,这对于处理斜体文本特别有利,但也可以用于常规文本。 对应于文字的垂直中心部分的文本行的水平中心区域用于生成仅中心区域的垂​​直投影轮廓。 通过将该轮廓的两个主峰定位并将两个主峰位置定义为中心区的上边界和下边界,使用水平投影轮廓来确定中心区。 识别垂直投影轮廓中的间隔段(白色间隙),并将其分为两个类别,即字符间距(字符与字之间的间距)和字间距(字间)。 单词间距用于将文本行分割成单词段。

    DOCUMENT IMAGE COMPRESSION METHOD AND ITS APPLICATION IN DOCUMENT AUTHENTICATION
    13.
    发明申请
    DOCUMENT IMAGE COMPRESSION METHOD AND ITS APPLICATION IN DOCUMENT AUTHENTICATION 审中-公开
    文件图像压缩方法及其在文件认证中的应用

    公开(公告)号:US20160078632A1

    公开(公告)日:2016-03-17

    申请号:US14951701

    申请日:2015-11-25

    Inventor: Yibin Tian Wei Ming

    Abstract: A method for compressing a bi-level document image containing text is disclosed. The document image is segmented into symbol images each representing a letter, numeral, etc. in the document. The symbol images are classified into a plurality of classes, each class being associated with a template image and a class index. Classification is done by comparing each symbol to be classified with template of existing classes, using a number of image features including zoning profiles, side profiles, topology statistics, and low-order image moments. These image features are compared using a tolerance based method to determine whether the symbol matches the template. After classification, certain classes that have few symbols classified into them may be merged with other classes. In addition, the template images of the classes are down-sampled, where the final sizes of the template images are dependent on the likelihood of confusion of the template with other templates.

    Abstract translation: 公开了一种用于压缩包含文本的双层文档图像的方法。 文档图像被分割成符号图像,每个表示文档中的字母,数字等。 符号图像被分类为多个类,每个类与模板图像和类索引相关联。 通过使用许多图像特征(包括分区轮廓,侧面轮廓,拓扑统计和低阶图像力矩)将每个要分类的符号与现有类别的模板进行比较来进行分类。 使用基于公差的方法比较这些图像特征以确定符号是否与模板匹配。 分类后,分类为其中几个符号的某些类可能会与其他类合并。 此外,类的模板图像被采样,其中模板图像的最终尺寸取决于模板与其他模板混淆的可能性。

    Print management in print-on-demand jobs
    14.
    发明授权
    Print management in print-on-demand jobs 有权
    打印点播作业中的打印管理

    公开(公告)号:US09232108B2

    公开(公告)日:2016-01-05

    申请号:US14041026

    申请日:2013-09-30

    Inventor: Wei Ming

    Abstract: A method for managing reproduction of a print generation of a document, where a machine-readable pattern of the original print has been previously generated and printed on the original print and containing document registration and management information of the original print. The method includes the steps of receiving a print-on-demand (POD) job order for producing a reprint of the original print, retrieving document registration information and print management information from the machine-readable pattern, authenticating the original print based on the document registration information, verifying reprint permission based on the print management information, generating a new machine-readable pattern for the reprint, maintaining a master machine-readable pattern on a digital form of the document or a data file for the document with updated information of the reprint, and completing the POD job order by producing the reprint with the new machine-readable pattern.

    Abstract translation: 一种用于管理文件的打印生成的方法,其中原始打印的机器可读图案已经被预先生成并打印在原始打印上,并且包含原始打印的文档注册和管理信息。 该方法包括以下步骤:接收用于产生原始打印的重新打印的打印点播(POD)作业顺序,从机器可读模式检索文档注册信息和打印管理信息,基于文档验证原始打印 注册信息,基于打印管理信息验证重新打印许可,生成用于重新打印的新的机器可读模式,将文档的数字形式的主机读取模式或文档的数据文件维护为更新的 通过使用新的机器可读模式生成重印,重新打印和完成POD作业顺序。

    Method and system for enhancing interactions between teachers and students

    公开(公告)号:US10013889B2

    公开(公告)日:2018-07-03

    申请号:US14230949

    申请日:2014-03-31

    Inventor: Yibin Tian Wei Ming

    CPC classification number: G09B5/00 G09B7/02

    Abstract: A method, computer program product, and a system for enhancing an interaction between a teacher and a student are disclosed, the method includes receiving video images of a region of interest from a plurality of multi-functional devices; comparing the video images of the region of interest received from the plurality of multi-functional devices; detecting differences in the region of interest of at least one multi-functional device in comparison to the region of interest of the plurality of multi-functional devices; and providing a signal to the at least one multi-functional device based on the detected difference in the region of interest.

    Method and apparatus for authenticating printed documents that contains both dark and halftone text
    17.
    发明授权
    Method and apparatus for authenticating printed documents that contains both dark and halftone text 有权
    用于认证包含黑暗和半色调文本的打印文档的方法和装置

    公开(公告)号:US09596378B2

    公开(公告)日:2017-03-14

    申请号:US15071623

    申请日:2016-03-16

    Abstract: A document authentication method determines the authenticity of a target hardcopy document, which purports to be a true copy of an original hardcopy document. The method compares a binarized image of the target document with a binarized image of the original document which has been stored in a storage device. The image of the original document is generated by binarizing a scanned grayscale image of the original document. Halftone and non-halftone text areas in the grayscale image area separated, and the two types of text are separately binarized. The non-halftone text areas are then down-sampled. During authenticating, a scanned grayscale image of the target document is binarized by separating halftone and non-halftone text areas and binarizing them separately, and then down-sampling the non-halftone text areas. The binarized images of the target document and the original document are compared to determine the authenticity of the target document.

    Abstract translation: 文档认证方法确定目标硬拷贝文档的真实性,其目的在于是原始硬拷贝文档的真实副本。 该方法将目标文档的二值化图像与存储在存储装置中的原始文档的二值化图像进行比较。 原始文档的图像是通过对原始文档的扫描灰度图像进行二值化生成的。 灰度图像区域中的半色调和非半色调文本区域分开,两种类型的文本被分开二进制化。 非半色调文本区域然后被下采样。 在认证过程中,目标文档的扫描灰度图像通过分离半色调和非半色调文本区域进行二进制化,并将它们分开二值化,然后对非半色调文本区域进行下采样。 比较目标文档和原始文档的二值化图像,以确定目标文档的真实性。

    AUTOMATIC SELECTION OF OPTIMUM ALGORITHMS FOR HIGH DYNAMIC RANGE IMAGE PROCESSING BASED ON SCENE CLASSIFICATION
    18.
    发明申请
    AUTOMATIC SELECTION OF OPTIMUM ALGORITHMS FOR HIGH DYNAMIC RANGE IMAGE PROCESSING BASED ON SCENE CLASSIFICATION 有权
    自动选择基于场景分类的高动态范围图像处理的最优算法

    公开(公告)号:US20150170389A1

    公开(公告)日:2015-06-18

    申请号:US14105652

    申请日:2013-12-13

    Abstract: A method for processing high dynamic range (HDR) images by selecting preferred tone mapping operators and gamut mapping algorithms based on scene classification. Scenes are classified into indoor scenes, outdoor scenes, and scenes with people, and tone mapping operators and gamut mapping algorithms are selected on that basis. Prior to scene classification, the multiple images taken at various exposure values are fused into a low dynamic range (LDR) image using an exposure fusing algorithm, and scene classification is performed using the fused LDR image. Then, the HDR image generated from the multiple images are tone mapped into a LDR image using the selected tone mapping operator and then gamut mapped to the color space of the output device such as printer.

    Abstract translation: 一种通过选择基于场景分类的优选色调映射算子和色域映射算法来处理高动态范围(HDR)图像的方法。 场景分为室内场景,户外场景和人物场景,并在此基础上选择色调映射运算符和色域映射算法。 在场景分类之前,使用曝光融合算法将以各种曝光值拍摄的多张图像融合到低动态范围(LDR)图像中,并使用融合的LDR图像进行场景分类。 然后,使用所选择的色调映射算子将从多个图像生成的HDR图像进行色调映射到LDR图像中,然后将色域映射到诸如打印机的输出设备的颜色空间。

Patent Agency Ranking