-
公开(公告)号:DE69231050T2
公开(公告)日:2000-09-14
申请号:DE69231050
申请日:1992-07-30
Applicant: XEROX CORP
Inventor: CASS TODD A , HUTTENLOCHER DANIEL P , WAYNER PETER C
Abstract: Characteristics of images, such as skew of lines of text or dominant typeface of characters, are detected by producing distance data for each of a number of starting pixels within an image. Each starting pixel meets a criterion, such as an edge pixel or a pixel inside a connected component. Each starting pixel's distance data indicates the distance in each of a number of directions from the starting pixel to another pixel at which the image meets another criterion. For example, to detect skew of lines of text, the distance data can indicate distance from a starting pixel at an edge across white pixels to another edge. To detect dominant typeface, the distance data can indicate distance from a starting pixel at an edge or inside a connected component across black pixels to another edge. The separating angle between adjacent directions can be small enough to detect features of an appropriate size at an appropriate distance, such as features of character size at the average intercharacter spacing. The distances at each direction from all the starting pixels can be averaged to obtain a combined distance at each direction. A sufficient number of starting pixels can be used so that the combined distance is approximately the same as if every pixel in the image were a starting pixel. The combined distances at all the directions form a profile. Skew can be determined from peaks in a profile. A profile can be compared with each of a set of model profiles to detect typeface.
-
公开(公告)号:DE69231050D1
公开(公告)日:2000-06-21
申请号:DE69231050
申请日:1992-07-30
Applicant: XEROX CORP
Inventor: CASS TODD A , HUTTENLOCHER DANIEL P , WAYNER PETER C
Abstract: Characteristics of images, such as skew of lines of text or dominant typeface of characters, are detected by producing distance data for each of a number of starting pixels within an image. Each starting pixel meets a criterion, such as an edge pixel or a pixel inside a connected component. Each starting pixel's distance data indicates the distance in each of a number of directions from the starting pixel to another pixel at which the image meets another criterion. For example, to detect skew of lines of text, the distance data can indicate distance from a starting pixel at an edge across white pixels to another edge. To detect dominant typeface, the distance data can indicate distance from a starting pixel at an edge or inside a connected component across black pixels to another edge. The separating angle between adjacent directions can be small enough to detect features of an appropriate size at an appropriate distance, such as features of character size at the average intercharacter spacing. The distances at each direction from all the starting pixels can be averaged to obtain a combined distance at each direction. A sufficient number of starting pixels can be used so that the combined distance is approximately the same as if every pixel in the image were a starting pixel. The combined distances at all the directions form a profile. Skew can be determined from peaks in a profile. A profile can be compared with each of a set of model profiles to detect typeface.
-
公开(公告)号:DE69226609T2
公开(公告)日:1999-02-04
申请号:DE69226609
申请日:1992-11-16
Applicant: XEROX CORP
Inventor: CASS TODD A , HUTTENLOCHER DANIEL P , HALVORSEN PER-KRISTIAN , WITHGOTT M MARGARET , KAPLAN RONALD M , RAO RAMANA B , BLOOMBERG DAN S
Abstract: Methods and apparatus of processing an undecoded document image in a digital computer to modify the document image so as to emphasize semantically significant portions without first converting the document image to character codes. The document image is segmented into image units, and morphological image characteristics of the image units are evaluated to identify significant image units for emphasis. In one embodiment, the significant image units are emphasized by modifying at least one shape characteristic of the significant image units using at least one uniform morphological bitmap operation applied to the entire image unit bitmaps corresponding to the significant image units.
-
公开(公告)号:DE69942297D1
公开(公告)日:2010-06-10
申请号:DE69942297
申请日:1999-12-16
Applicant: XEROX CORP
Inventor: ADLER ANNETTE M , FISHIKIN KENNETH P , MARSHALL CATHERINE C , SILVERMAN ALEXANDER E , CASS TODD A
Abstract: A system and method for extracting key information from digitized audio messages, including telephone voice messages. Information, such as a telephone number and the name of the caller, is derived and extracted from a voice message and used to establish links to the information within the message. The telephone number and name of the caller can then be replayed without the need to replay the entire voice message. The telephone number and name of the caller can also be used as indices into an information database.
-
公开(公告)号:DE60312572D1
公开(公告)日:2007-05-03
申请号:DE60312572
申请日:2003-01-27
Applicant: XEROX CORP
Inventor: SAUND ERIC , MORAN THOMAS P , LARNER DANIEL L , MAHONEY JAMES V , CASS TODD A
IPC: G06T11/60 , G06V30/224 , G09G5/00 , H04N1/387
Abstract: In accordance with one aspect of the present invention, disclosed is an image analysis and conversion method and system, where digital ink images are converted to structured object representations of the digital ink images, capable of being edited by a structured text/graphics editor.
-
公开(公告)号:DE69718959T2
公开(公告)日:2003-07-24
申请号:DE69718959
申请日:1997-05-01
Applicant: XEROX CORP
Inventor: CASS TODD A
Abstract: A processor is provided (410-430) with first and second document images. The first image represents an instance of a reference document to which instance a mark has been added. The second image is selected from among a collection of document images and represents the reference document without the mark. The processor automatically extracts (450) from the first document image a set of pixels representing the mark. This is done by performing a reference-based mark extraction technique in which the second document image serves as a reference image and in which substantially the entirety of the first document image is compared with substantially the entirety of the second document image. Also, the processor is provided (440) with information about a set of active elements of the reference document. The reference document has at least one such active element and each active element is associated with at least one action. The processor interprets (460) the extracted set of pixels representing the mark by determining whether the mark indicates any of the active elements of the reference document. If the mark indicates an active element, the processor facilitates (470) the action with which the indicated active element is associated.
-
公开(公告)号:DE69708214T2
公开(公告)日:2002-05-16
申请号:DE69708214
申请日:1997-06-16
Applicant: XEROX CORP
Inventor: JACKSON WARREN B , BIEGELSEN DAVID K , BERLIN ANDREW A , SPRAGUE ROBERT A , CASS TODD A
Abstract: A sensor system (100a) for measuring physical properties of a sheet (116). In one embodiment, the sensor system measures paper curl and thickness using two sensors, each of which includes a member (112), a base (114), and measurement circuit. The two members (112) are positioned in opposition to each other and both contact the sheet (116) as it passes between them. Each member is coupled to a base (114), which includes a measurement circuit. Each measurement circuit measures the displacement of its associated member (112). In another embodiment (Fig. 4), the sensor system measures stiffness and curl using two pairs of opposed sensors. In yet another embodiment, the sensor system measures thermal diffusivity of a sheet using three sensors, one of which includes a heater for heating the sheet of paper and the other two sensors include thermocouples in contact with the sheet of paper for sensing the heat of the paper.
-
公开(公告)号:DE69226609D1
公开(公告)日:1998-09-17
申请号:DE69226609
申请日:1992-11-16
Applicant: XEROX CORP
Inventor: CASS TODD A , HUTTENLOCHER DANIEL P , HALVORSEN PER-KRISTIAN , WITHGOTT M MARGARET , KAPLAN RONALD M , RAO RAMANA B , BLOOMBERG DAN S
Abstract: Methods and apparatus of processing an undecoded document image in a digital computer to modify the document image so as to emphasize semantically significant portions without first converting the document image to character codes. The document image is segmented into image units, and morphological image characteristics of the image units are evaluated to identify significant image units for emphasis. In one embodiment, the significant image units are emphasized by modifying at least one shape characteristic of the significant image units using at least one uniform morphological bitmap operation applied to the entire image unit bitmaps corresponding to the significant image units.
-
公开(公告)号:CA2078423C
公开(公告)日:1997-01-14
申请号:CA2078423
申请日:1992-09-16
Applicant: XEROX CORP
Inventor: WITHGOTT M MARGARET , NEWMAN WILLIAM , BAGLEY STEVEN C , HUTTENLOCHER DANIEL P , KAPLAN RONALD M , CASS TODD A , HALVORSEN PER-KRISTIAN , BROWN JOHN SEELY , KAY MARTIN
Abstract: A method and apparatus for applying morphological image criteria that identify image units in an undecoded document image having significant information content, and for retrieving related data that supplements the document either from elsewhere within the document or a source external to the document. The retrieved data can result from character code, recognition or template matching of the identified significant image units, or the retrieved data can result directly from an analysis of the morphological image characteristics of the identified significant image units. A reading machine can allow a user to browse and select documents or segments thereof, and to obtain interactive retrieval of documents and supplemental data.
-
公开(公告)号:CA2078423A1
公开(公告)日:1993-05-20
申请号:CA2078423
申请日:1992-09-16
Applicant: XEROX CORP
Inventor: WITHGOTT M MARGARET , NEWMAN WILLIAM , BAGLEY STEVEN C , HUTTENLOCHER DANIEL P , KAPLAN RONALD M , CASS TODD A , HALVORSEN PER-KRISTIAN , BROWN JOHN S , KAY MARTIN
Abstract: A method and apparatus for applying morphological image criteria that identify image units in an undecoded document image having significant information content, and for retrieving related data that supplements the document either from elsewhere within the document or a source external to the document. The retrieved data can result from character code recognition or template matching of the identified significant image units, or the retrieved data can result directly from an analysis of the morphological image characteristics of the identified significant image units. A reading machine can allow a user to browse and select documents or segments thereof, and to obtain interactive retrieval of documents and supplemental data.
-
-
-
-
-
-
-
-
-