Automatic method of extracting summarization using feature probabilities
    1.
    发明公开
    Automatic method of extracting summarization using feature probabilities 失效
    Automatische Methode zur Extraktionszusammenfassung durch Gebrauch von Merkmal-Wahrscheinlichkeiten

    公开(公告)号:EP0751469A1

    公开(公告)日:1997-01-02

    申请号:EP96304777.4

    申请日:1996-06-28

    CPC classification number: G06F17/30719

    Abstract: A method of automatically generating document extracts. The method makes use of feature value probabilities generated from a statistical analysis of manually generated summaries to extract the same set of sentences an expert might. The method is based upon an iterative approach. First, the computer system designates a sentence of the document as a selected sentence. Second, the computer system determine values for the selected sentence of each feature of a feature set. Third, the computer system increases a score for the selected sentence based upon the value of the feature for the selected sentence and upon the probability associated with that value. Fourth, after scoring all of the sentences of the document the computer system, the computer system selects a subset of the highest scoring sentences to be extracted.

    Abstract translation: 自动生成文档提取的方法。 该方法利用从手动生成的摘要的统计分析生成的特征值概率来提取专家可能的同一组句子。 该方法基于迭代方法。 首先,计算机系统将文档的句子指定为所选择的句子。 第二,计算机系统确定特征集的每个特征的所选择的句子的值。 第三,计算机系统基于所选择的句子的特征值以及与该值相关联的概率来增加所选句子的得分。 第四,在对计算机系统的文档的所有句子进行评分之后,计算机系统选择要提取的最高得分句子的子集。

    Automatic method of generating feature probabilities for automatic extracting summarization
    4.
    发明公开
    Automatic method of generating feature probabilities for automatic extracting summarization 失效
    生成用于自动提取摘要功能概率的自动方法

    公开(公告)号:EP0751470A1

    公开(公告)日:1997-01-02

    申请号:EP96304778.2

    申请日:1996-06-28

    CPC classification number: G06F17/30719

    Abstract: A method of automatically generating feature probabilities that allow later automatic generation of document extracts. The computer system generates the probabilities by analyzing each document a document at a time. First, the computer system designates one of the documents as a selected document. Next, the computer system analyzes each sentence of the selected document to determine the value of the paragraph feature and the value of the uppercase feature. The computer system repeats this effort for each document of the document corpus. Afterward, the number of occurrences of each value of each feature is calculated and is used to calculate feature value probabilities for all of the features.

    Abstract translation: 自动生成特征的概率的方法确实允许后自动生成文件提取物。 计算机系统基因利率同时分析每个文档的文档的概率。 首先,计算机系统指定文档作为一个选择的文档中的一个。 接着,计算机系统所选择的文档的每个句子分析,以确定矿段落特征的值和上壳体特征的值。 计算机系统重复这种努力的文档语料库的每个文档。 此后,每个特征的每个值的出现的次数被计算并用于计算特征值的概率的所有的特征。

    Automatic method of selecting multi-word key phrases from a document
    5.
    发明公开
    Automatic method of selecting multi-word key phrases from a document 失效
    对于选择自动方法从文档中包含几个单词,关键短语

    公开(公告)号:EP0741364A1

    公开(公告)日:1996-11-06

    申请号:EP96303094.5

    申请日:1996-05-01

    CPC classification number: G06F17/30616

    Abstract: An automatic method of generating key phrases for a machine readable document. The method begins by breaking (42) the text of the document into multi-word phrases free of stop words which begin and end acceptably. Then the most frequent phrases are selected (43-58) as key word phrases.

    Abstract translation: 生成关键短语用于机器可读文件的自动方法。 该方法开始打破(42)将文档分成多字的文本短语不含可开始和结束可以接受停用词。 然后最频繁的短语选择(43-58)的关键字短语。

Patent Agency Ranking