자연어 처리를 위한 에이치.티.엠.엘/에스.지.엠.엘 태그 처리장치 및 방법
    1.
    发明公开
    자연어 처리를 위한 에이치.티.엠.엘/에스.지.엠.엘 태그 처리장치 및 방법 无效
    用于自然语言处理HTML / SGML标签的装置和方法

    公开(公告)号:KR1020010018214A

    公开(公告)日:2001-03-05

    申请号:KR1019990034077

    申请日:1999-08-18

    Abstract: PURPOSE: The apparatus and the method of processing a HTML/SGML tag for a natural language are provided to enable an original document to be recognized by a sentence unit, and to generate a corrected document without the loss of the tag in the original document, by distinguishing a sentence unit tag from a word unit tag, and by using a user definition tag for a script and notes. CONSTITUTION: A memory load device(2) loads the data in a disk to a memory in response to an inputted HTML document(1). A HTML document(2) is loaded to the memory by the device(2). A device(3) separates the tag from the HTML document in the memory. A HTML tag database(3a) is referred in the separation of the tag. A memory(3b) stores the separated tag. A sentence recognition part(4) recognizes a sentence from which the tag is separated. A part(4a) stores the result of the sentence recognition. A device(5) processes the result thereof to form a translation and a summary. A memory(5a) stores the contents of the processed result. A tag recovery part(6) couples the tag with the contents referring to the data in the memory(5a) and the memory(3b). A memory(6a) stores the recovered tag.

    Abstract translation: 目的:提供用于处理自然语言的HTML / SGML标签的装置和方法,以使原始文档能够由句子单元识别,并且生成经校正的文档而不损失原始文档中的标签, 通过将句子单位标签与单词标签区分开来,并通过使用用户定义标签进行脚本和笔记。 构成:存储器加载设备(2)响应输入的HTML文档(1)将磁盘中的数据加载到存储器。 HTML文档(2)由设备(2)加载到存储器。 设备(3)将标签与内存中的HTML文档分开。 在标签的分离中引用HTML标签数据库(3a)。 存储器(3b)存储分离的标签。 句子识别部分(4)识别标签被分离的句子。 部分(4a)存储句子识别的结果。 设备(5)处理其结果以形成翻译和摘要。 存储器(5a)存储处理结果的内容。 标签恢复部分(6)根据存储器(5a)和存储器(3b)中的数据将标签与内容耦合。 存储器(6a)存储恢复的标签。

Patent Agency Ranking