Abstract:
A table and flowchart detection method is disclosed. First, based on connected component analysis and the sizes of the connected components, a target connected component that corresponds to possible elements of table or flowchart is detected in the input image. The target connected component is broken into corners and edges that connect the corners. Based on the relationship between the corners and edges, it is determined whether the target connected component is a table or a flowchart. For table detection, the edges and corners are linked into horizontal sets and vertical sets, and based on corner counts in the horizontal sets and vertical sets, it is determined whether the target connected component is a table. For flowchart detection, the boundary boxes and connecting lines between boundary boxes are detected to determine whether the target connected component is a flowchart.
Abstract:
A method for image processing, including: obtaining an image including a writing board and a background external to the writing board; detecting a plurality of lines within the image; determining, based on the plurality of lines, a plurality of corners of the writing board within the image; and correcting a perspective of the writing board by applying a transformation to the image based on the plurality of corners.
Abstract:
A 2D color barcode layout is disclosed. The barcode includes a 2D array of data cells, four corner locators, and border reference cells forming four borders between the corner locators that substantially surround the array of data cells. Each data cell and border reference cell has one of four primary colors (e.g. CMYK). Most border reference cells have the same size as the data cells, except for yellow ones which are longer. The border reference cells form a repeating color sequence along the borders, and are used during decoding to calculate (1) the channel offset (a spatial offset) of each primary color at different locations along the borders and (2) the reference (average) color values of each primary color. During decoding, the color values of each data cell is measured while taking into account channel offset which is calculated by interpolating the channel offset of the border reference cells.
Abstract:
A vertical and horizontal line detection method for document images includes generating multiple binary images from the input grayscale document image based on multiple binarization thresholds, detecting horizontal and vertical lines in each of the multiple binary images independently, and merging the detection results from the multiple binary images. The line detection process for each binary image include applying an opening operation using a vertical or horizontal line as the structuring element, and removing connected components that are not vertical or horizontal lines based on a stroke width analysis. The boundaries of the detected lines are obtained using horizontal and vertical projections.
Abstract:
A vertical and horizontal line detection method for document images includes generating multiple binary images from the input grayscale document image based on multiple binarization thresholds, detecting horizontal and vertical lines in each of the multiple binary images independently, and merging the detection results from the multiple binary images. The line detection process for each binary image include applying an opening operation using a vertical or horizontal line as the structuring element, and removing connected components that are not vertical or horizontal lines based on a stroke width analysis. The boundaries of the detected lines are obtained using horizontal and vertical projections.
Abstract:
A processing method for a color barcode image captured by a camera or barcode reader. The barcode includes color data cells and multiple large black locators which are located at the four corners of the barcode, along the four borders, and inside the barcode forming an array. The locators are first identified from a grayscale version of the barcode image using multiple binarization technique to ensure that all locators are identified. Perspective correction and illumination correction may be applied to the barcode image using the identified locators.
Abstract:
A processing method for a color barcode image captured by a camera or barcode reader. The barcode includes color data cells and multiple large black locators which are located at the four corners of the barcode, along the four borders, and inside the barcode forming an array. The locators are first identified. Perspective correction is performed by dividing the barcode image into smaller regions each containing four locators, individually transforming each region into its original shape e.g. square, and spatially combining them into the barcode image. Illumination correction is applied to the barcode image based on average pixel intensities of the locators, by calculating an illumination correction map using the average densities of the locators and 2-dimensional interpolation for pixel positions other than the locators.
Abstract:
A method for binarization of document image using multi-threshold process to determine an optimum global binarization threshold for the image. The optimum binarization threshold is determined by binarizing the document multiple times using different threshold values, and calculating the statistics of the useful information and noise for each threshold value to select the optimum threshold value.
Abstract:
A 2D color barcode layout is disclosed. The barcode includes a 2D array of data cells, four corner locators, and border reference cells forming four borders between the corner locators that substantially surround the array of data cells. Each data cell and border reference cell has one of four primary colors (e.g. CMYK). Most border reference cells have the same size as the data cells, except for yellow ones which are longer. The border reference cells form a repeating color sequence along the borders, and are used during decoding to calculate (1) the channel offset (a spatial offset) of each primary color at different locations along the borders and (2) the reference (average) color values of each primary color. During decoding, the color values of each data cell is measured while taking into account channel offset which is calculated by interpolating the channel offset of the border reference cells.
Abstract:
An image encoded with character information can be created by binarizing an input image followed by connected component labeling, and then repeating the binarization and connected component labeling on an inverted version of the input image. This results in identification of connected components. Related connected components are arranged in a family tree in which successive generations of the connected components alternate between two tree layer classifications. One of the tree layer classifications is selected based on whether certain connected components define characters. A label image is created which includes labels for the connected components except for the connected components in the selected tree layer classification.