SYSTEM AND METHOD FOR AUTOMATIC PREPARATION AND SEARCHING OF SCANNED DOCUMENTS
    1.
    发明申请
    SYSTEM AND METHOD FOR AUTOMATIC PREPARATION AND SEARCHING OF SCANNED DOCUMENTS 审中-公开
    系统和方法自动准备和搜索扫描文件

    公开(公告)号:WO0217166A3

    公开(公告)日:2002-06-13

    申请号:PCT/IL0100797

    申请日:2001-08-24

    CPC classification number: G06K9/00 G06F17/30253

    Abstract: A system and a method for converting microfilm data in a digital format for publishing through a network such as the Internet. First, an image is created of the microfilm, preferably in the TIFF format. Next, the words of the image are recognized through a process of OCR (optical character recognition), with an associated probability of error. The image data can then be converted into a digital format for publication, for example as XML data. Preferably, the user is able to perform a keyword search on the digital format data. More preferably, the keyword search is an adaptive search.

    Abstract translation: 用于将数字格式的缩微胶片数据转换为通过诸如因特网之类的网络进行发布的系统和方法。 首先,制作缩微胶片的图像,最好是TIFF格式。 接下来,通过OCR(光学字符识别)的过程识别图像的字,并具有相关的错误概率。 然后可以将图像数据转换为数字格式用于发布,例如作为XML数据。 优选地,用户能够对数字格式数据执行关键字搜索。 更优选地,关键字搜索是自适应搜索。

    SYSTEM AND METHOD FOR DATA PUBLICATION THROUGH WEB PAGES
    2.
    发明申请
    SYSTEM AND METHOD FOR DATA PUBLICATION THROUGH WEB PAGES 审中-公开
    通过网页数据发布的系统和方法

    公开(公告)号:WO0198948A3

    公开(公告)日:2002-08-29

    申请号:PCT/US0114858

    申请日:2001-06-15

    CPC classification number: G06F17/3089 G06F17/211

    Abstract: A system and a method for publishing a newspaper page or other data through a Web page, such that the information can be made available more easily through a network such as the Internet. The data is automatically converted to the Web page format by first rendering the newspaper page into a digital format; converting the digital format to a basic internal publishing format; and then publishing the data in any one of a number of different possible publishing formats, including but not limited to, a mark-up langage document such as a Web page for example. The present invention supports such advanced features as arrangement of the content of the newspaper according to relationships within the information of the content and/or according to the preference(s) of the user by analyzing the newspaper page as a plurality of objects. Each newspaper object may optionally be a title, an article, a picture and/or other graphic advertisement, and so forth.

    Abstract translation: 一种用于通过网页发布报纸页面或其他数据的系统和方法,使得可以通过诸如因特网的网络更容易地获得信息。 数据自动转换为网页格式,首先将报纸页面呈现为数字格式; 将数字格式转换为基本的内部发布格式; 然后以许多不同的可能的发布格式(包括但不限于诸如网页的标记语言文档)中的任何一种发布数据。 本发明通过分析报纸页面作为多个对象,根据内容的信息内的关系和/或根据用户的偏好来支持报纸内容的布置。 每个报纸对象可以可选地是标题,文章,图片和/或其他图形广告等。

    SYSTEM AND METHOD FOR DATA PUBLICATION THROUGH WEB PAGES
    4.
    发明申请
    SYSTEM AND METHOD FOR DATA PUBLICATION THROUGH WEB PAGES 审中-公开
    通过网页数据发布的系统和方法

    公开(公告)号:WO0198948A8

    公开(公告)日:2003-11-20

    申请号:PCT/US0114858

    申请日:2001-06-15

    CPC classification number: G06F17/3089 G06F17/211

    Abstract: A system and a method for publishing a newspaper page or other data through a Web page, such that the information can be made available more easily through a network such as the Internet. The data is automatically converted to the Web page format by first rendering the newspaper page into a digital format; converting the digital format to a basic internal publishing format; and then publishing the data in any one of a number of different possible publishing formats, including but not limited to, a mark-up langage document such as a Web page for example. The present invention supports such advanced features as arrangement of the content of the newspaper according to relationships within the information of the content and/or according to the preference(s) of the user by analyzing the newspaper page as a plurality of objects. Each newspaper object may optionally be a title, an article, a picture and/or other graphic advertisement, and so forth.

    Abstract translation: 一种用于通过网页发布报纸页面或其他数据的系统和方法,使得可以通过诸如因特网的网络更容易地获得信息。 数据自动转换为网页格式,首先将报纸页面呈现为数字格式; 将数字格式转换为基本的内部发布格式; 然后以许多不同的可能的发布格式(包括但不限于诸如网页的标记语言文档)中的任何一种发布数据。 本发明通过分析报纸页面作为多个对象,根据内容的信息内的关系和/或根据用户的偏好来支持报纸内容的布置。 每个报纸对象可以可选地是标题,文章,图片和/或其他图形广告等。

    SYSTEM AND METHOD FOR AUTOMATIC PREPARATION OF DATA REPOSITORIES FROM MICROFILM-TYPE MATERIALS

    公开(公告)号:AU2003269468A1

    公开(公告)日:2004-05-04

    申请号:AU2003269468

    申请日:2003-10-12

    Abstract: A system and a method for the conversion of archived documents to a digital format and storage of the data extracted in repositories which may be easily extracted and searched by a user over a network such as the Internet. The data is preferably stored in the form of microfilm, although optionally the present invention could be operative with other types of physical media, such as microfiche, paper and any type of printed material. The microfilm data is preferably divided and/or grouped into at least one file. Optionally and preferably, each file undergoes the following automatic processing stages: combining files; analyzing image layout; segmentation; OCR; optional segmentation improvement; and output to XML, or another suitable output data format and/or language. In the last stage, the data contained in the files is preferably extracted and then more preferably transmitted to the relevant repository unit.

    10.
    发明专利
    未知

    公开(公告)号:AT451660T

    公开(公告)日:2009-12-15

    申请号:AT03751248

    申请日:2003-10-12

    Abstract: A system and a method for the conversion of archived documents to a digital format and storage of the data extracted in repositories which may be easily extracted and searched by a user over a network such as the Internet. The data is preferably stored in the form of microfilm, although optionally the present invention could be operative with other types of physical media, such as microfiche, paper and any type of printed material. The microfilm data is preferably divided and/or grouped into at least one file. Optionally and preferably, each file undergoes the following automatic processing stages: combining files; analyzing image layout; segmentation; OCR; optional segmentation improvement; and output to XML, or another suitable output data format and/or language. In the last stage, the data contained in the files is preferably extracted and then more preferably transmitted to the relevant repository unit.

Patent Agency Ranking