TEXT SEGMENTATION AND LABEL ASSIGNMENT WITH USER INTERACTION BY MEANS OF TOPIC SPECIFIC LANGUAGE MODELS AND TOPIC-SPECIFIC LABEL STATISTICS
    11.
    发明申请
    TEXT SEGMENTATION AND LABEL ASSIGNMENT WITH USER INTERACTION BY MEANS OF TOPIC SPECIFIC LANGUAGE MODELS AND TOPIC-SPECIFIC LABEL STATISTICS 审中-公开
    用主题特定语言模型和主题特定标签统计的用户交互的文本分段和标签分配

    公开(公告)号:WO2005050474A3

    公开(公告)日:2006-07-13

    申请号:PCT/IB2004052405

    申请日:2004-11-12

    CPC classification number: G06F17/21 G06F17/27 G06F17/2765

    Abstract: The invention relates to a method, a computer program product, a segmentation system and a user interface for structuring an unstructured text by making use of statistical models trained on annotated training data. The method performs text segmentation into text sections and assigns labels to text sections as section headings. The performed segmentation and assignment is provided to a user for general review. Additionally, alternative segmentations and label assignments are provided to the user being capable to select alternative segmentations and alternative labels as well as to enter a user defined segmentation and user defined label. In response to the modifications introduced by the user, a plurality of different actions are initiated incorporating the re-segmentation and re-labelling of successive parts of the document or the entire document. Furthermore the method comprises a learning functionality, logging and analyzing user introduced modifications for adaptation of the method to the user's preferences and for further training of the statistical models.

    Abstract translation: 本发明涉及一种方法,计算机程序产品,分割系统和用户界面,用于通过利用在注释训练数据上训练的统计模型来构造非结构化文本。 该方法执行文本分段到文本部分,并将标签分配给文本部分作为标题。 执行的分割和分配被提供给用户进行一般审查。 此外,替代分割和标签分配被提供给能够选择替代分割和替代标签以及输入用户定义的分割和用户定义标签的用户。 响应于用户引入的修改,启动了多个不同的动作,其中包括文档或整个文档的连续部分的重新分割和重新标记。 此外,该方法包括学习功能,记录和分析用户引入的修改以将该方法适应于用户的偏好和进一步训练统计模型。

Patent Agency Ranking