Abstract:
Methods and systems are provided for check code line recognition at a point-of-sale terminal having OCR and MICR capabilities. Images of problematic characters are extracted and automatically transmitted to a remote location for on-line manual validation or data entry. Video coding of problematic characters is performed for the correction of either or both MICR and OCR results. The correctly encoded characters are returned to the point of sale within a few seconds, and combined as necessary with locally recognized characters, to assemble a correct code line for entry into a payment system.
Abstract:
PROBLEM TO BE SOLVED: To estimate a relative threshold corresponding to an intensity difference between a text and a background in an OCR system. SOLUTION: A text pixel is determined in accordance with a result that differences between the value of a pixel 10 and the values of plural pixels separated from the pixel 10 by a prescribed distance are larger than a relative threshold corresponding to an intensity difference between a text and a background or not, an image is subsamplied at a rate corresponding to two pixels for detecting the kernel of the text and an image pixel is binarized only on a tile having the sideface of plural stroke widths and including the kernel of the text by using the estimated threshold. In the determination of a text pixel, which difference out of differences between two pixels located on positions where a circle 12 having a radius equal to stroke width W around a pixel to be analyzed intersects with a row line, a column line and two lines having an 45 deg. angle and the value of the pixel to be analyzed is larger than the relative threshold is examined.
Abstract:
Selects number of anchor points in image, each assigned gray scale value. Determines horizontal and vertical deviation for each point, which is dependent on difference between gray scale value of anchor point and gray scale value of neighboring horizontal or vertical anchor point. Defines anchor point as dominant if corresponds to predefined condition. In first processing stage a general search for regions of interest (ROI) is carried out. Dominance is determined (52) by selecting first number of anchor points each having corresponding intensity. Each anchor point has horizontal and vertical deviation calculated to determine vertical and horizontal dominant points used to determine (54) probable text containing positions in image. Second processing stage carries out second iteration with text regions of interest found in first stage, to improve recognition of text ROIs, delete ROIs falsely labelled and to assign order of precedence to text ROIs.
Abstract:
A data entry system generates an electronically stored coded representation of a character sequence from one or more electronically stored document images, comprising optical character recognition logic (90) for generating, from the document image or images, character data specifying one of a plurality of possible character values for corresponding segments of the document images; characterised by interactive display apparatus comprising: means (110) for generating and sequentially displaying, one or more types of composite image, each composite image comprising segments of the document image or images arranged according to the character data, and a correction mechanism responsive to a user input operation to enable the operator to correct the character data associated with displayed segments.
Abstract:
A method for locating a structured field in a gray-scale image of an object, including choosing a plurality of anchor points in the image, each anchor point having a gray-scale value associated therewith. For each anchor point there is determined a horizontal variation dependent on a difference between the gray-scale value of the anchor point and the gray-scale value of a horizontally neighboring anchor point, and there is also determined a vertical variation dependent on a difference between the gray-scale value of the anchor point and the gray-scale value of a vertically neighboring anchor point. Those anchor points whose vertical and horizontal variations obey a first or a second predefined condition are defined as vertically or horizontally dominant respectively. One or more kernels are defined in the image, each such kernel comprising a group of anchor points n predetermined mutual proximity and satisfying a third predefined condition relating the number of vertically-dominant and horizontally-dominant anchor points in the group. The structured field in the image is located using one or more kernels.
Abstract:
A data entry system generates an electronically stored coded representation of a character sequence from one or more electronically stored document images, comprising optical character recognition logic (90) for generating, from the document image or images, character data specifying one of a plurality of possible character values for corresponding segments of the document images; characterised by interactive display apparatus comprising: means (110) for generating and sequentially displaying, one or more types of composite image, each composite image comprising segments of the document image or images arranged according to the character data, and a correction mechanism responsive to a user input operation to enable the operator to correct the character data associated with displayed segments.