Invention Grant
US08176054B2 Retrieving electronic documents by converting them to synthetic text
有权
通过将电子文档转换为合成文本来检索电子文档
- Patent Title: Retrieving electronic documents by converting them to synthetic text
- Patent Title (中): 通过将电子文档转换为合成文本来检索电子文档
-
Application No.: US11777142Application Date: 2007-07-12
-
Publication No.: US08176054B2Publication Date: 2012-05-08
- Inventor: Jorge Moraleda
- Applicant: Jorge Moraleda
- Applicant Address: JP Tokyo
- Assignee: Ricoh Co. Ltd
- Current Assignee: Ricoh Co. Ltd
- Current Assignee Address: JP Tokyo
- Agency: Patent Law Works LLP
- Main IPC: G06F7/00
- IPC: G06F7/00

Abstract:
The present invention relies on the two-dimensional information in documents and encodes two-dimensional structures into a one-dimensional synthetic language such that two-dimensional documents can be searched at text search speed. The system comprises: an indexing module, a retrieval module, an encoder, a quantization module, a retrieval engine and a control module coupled by a bus. A number of electronic documents are first indexed by the indexing module and stored as a synthetic text library. The retrieval module then converts and input image to synthetic text and searches for matches to the synthetic text in the synthetic text library. The matches can be in turn used to retrieve the corresponding electronic documents. It should be noted that a plurality of matches and corresponding electronic documents may be retrieves ranked by order according the similarity of the synthetic text. In one or more embodiments, the present invention includes a method for indexing documents by converting them to synthetic text, and a method for retrieving documents by converting an image to synthetic text and comparing the synthetic text to documents that have been converted to synthetic text for a match.
Public/Granted literature
- US20090018990A1 Retrieving Electronic Documents by Converting Them to Synthetic Text Public/Granted day:2009-01-15
Information query