TEXT COMPRESSION AND EXPANSION METHOD AND APPARATUS

    公开(公告)号:AU596713B2

    公开(公告)日:1990-05-10

    申请号:AU8163787

    申请日:1987-11-24

    Applicant: IBM

    Abstract: A text compression method and apparatus are disclosed that enable overall compression ratios of more than six or eight to one for normal language text. Plural multiple-word dictionaries that are specialized for the particular field of use are employed together with a header transmission format that identifies which dictionaries are to be used. In addition, entries in these dictionaries are categorized by a weighted frequency of use ranking in which the product of the word length in characters and the frequency of occurrence of that word in the text is taken as the weighted figure of merit for ranking words to be placed in the individual dictionaries.

    TEXT COMPRESSION AND EXPANSION METHOD AND APPARATUS

    公开(公告)号:AU8163787A

    公开(公告)日:1988-06-09

    申请号:AU8163787

    申请日:1987-11-24

    Applicant: IBM

    Abstract: A text compression method and apparatus are disclosed that enable overall compression ratios of more than six or eight to one for normal language text. Plural multiple-word dictionaries that are specialized for the particular field of use are employed together with a header transmission format that identifies which dictionaries are to be used. In addition, entries in these dictionaries are categorized by a weighted frequency of use ranking in which the product of the word length in characters and the frequency of occurrence of that word in the text is taken as the weighted figure of merit for ranking words to be placed in the individual dictionaries.

    13.
    发明专利
    未知

    公开(公告)号:NO875048L

    公开(公告)日:1988-06-06

    申请号:NO875048

    申请日:1987-12-03

    Applicant: IBM

    Abstract: A text compression method and apparatus are disclosed that enable overall compression ratios of more than six or eight to one for normal language text. Plural multiple-word dictionaries that are specialized for the particular field of use are employed together with a header transmission format that identifies which dictionaries are to be used. In addition, entries in these dictionaries are categorized by a weighted frequency of use ranking in which the product of the word length in characters and the frequency of occurrence of that word in the text is taken as the weighted figure of merit for ranking words to be placed in the individual dictionaries.

Patent Agency Ranking