-
公开(公告)号:DE2460757A1
公开(公告)日:1975-10-23
申请号:DE2460757
申请日:1974-12-21
Applicant: IBM
-
公开(公告)号:DE2541204A1
公开(公告)日:1976-04-15
申请号:DE2541204
申请日:1975-09-16
Applicant: IBM
Inventor: WILLIS BOLLINGER GEB WILLIS EL , CHAIRES GEB LYONS ANNE MARIE , CICONTE GEB SCHELTES JEAN MARI , ETT ALLEN HAROLD , HILLIARD JOHN JOSEPH , KOCHER DONALD FRANCIS , ROSENBAUM WALTER STEVEN
Abstract: A cluster storage apparatus is disclosed for outputting groups of valid alpha words as potential candidates for the correct form of an alpha word misrecognized by a character recognition machine. Groups of alpha words are arranged in the cluster storage apparatus such that adjacent locations contain alpha words having similar character recognition misread propensities. Alpha words which have been determined to be misrecognized, are input to the cluster storage apparatus. Numerical values assigned to the characters of which the input word is composed, are used to calculate the address of that group of valid alpha words having similar character recognition misread propensities. The cluster storage apparatus then outputs the accessed groups of alpha words for subsequent processing. The organization of the cluster storage apparatus minimizes the difference in address between alpha words with similar character recognition misread propensities by assigning high numeric values to highly reliable characters, as determined by measuring the character transfer function of the character recognition machine.
-
公开(公告)号:DE2435889A1
公开(公告)日:1975-10-16
申请号:DE2435889
申请日:1974-07-25
Applicant: IBM
Inventor: CHAIRES GEB LYONS ANNE MARIE , CICONTE GEB SCHELTES JEAN MARI , HILLIARD JOHN JOSEPH , ROSENBAUM WALTER STEVEN , ETT ALLEN HAROLD
Abstract: An online numeric discriminator is disclosed which performs the decision making process between strings of characters coming from a dual output optical character recognition system for use in text processing or mail processing applications. The dual output OCR uses separate recognition processes for alphabetic and numeric characters and attempts to recognize each character independently as both an alphabetic and a numeric character. The alphabetic interpretation of the scanned word is outputted as an alphabetic subfield on a first output line and the numeric interpretation of the scanned word is outputted as a numeric subfield on a second output line from the OCR. The bayesian online numeric discriminator then analyzes the two character streams by calculating a first conditional probability that the OCR perceived the alphabetic subfield given that a numeric subfield was actually scanned and a second conditional probability that the OCR perceived the numeric subfield given that an alphabetic subfield was actually scanned. These first and second conditional probabilities are then compared. If the conditional probability that the OCR read the alphabetic subfield given that the numeric subfield was actually scanned, is larger than the conditional probability that the OCR read the numeric subfield given that the alphabetic subfield was actually scanned, then the numeric subfield is selected by the discriminator as the most probable interpretation of the word scanned by the OCR.
-
-