-
公开(公告)号:KR1020140049149A
公开(公告)日:2014-04-25
申请号:KR1020120114534
申请日:2012-10-16
Applicant: 한국전자통신연구원
CPC classification number: G06F17/2705 , G06F17/2809
Abstract: The present invention relates to a device and a method for creating translation for a complex sentence. The translation creating device proposed by the present invention comprises a corpus analysis unit which classifies the original included in a corpus and an initial translation as a sentence unit and analyzes the sentence unit; a short sentence information extraction unit which extracts information about a predicate in each short sentence of the complex sentence included in the original based on the analyzed result of the original and extracts information about grammar and word class in each sentence of the complex sentence included in the initial translation based on the analyzed result of the initial translation; and a final translation creating unit which generates a final translation from the initial translation by using the information about the predicate and the information about the grammar and the word class. The present invention improves translation performance of the complex sentence. [Reference numerals] (510) Morpheme analysis unit; (520) Construction analysis unit; (531) Short sentence qualification extraction unit; (532) Optimal pattern selecting unit; (533) Complex sentence translation generation unit; (540) Short sentence unit translation generation unit; (550) Morpheme generation unit; (AA) Primitive language input sentence; (BB) Object language translation sentence
Abstract translation: 本发明涉及一种用于创建复杂句子的翻译的装置和方法。 本发明提出的翻译创建装置包括:语料库分析单元,其将包含在语料库中的原件和初始翻译分类为句子单元,并分析句子单元; 一种短句子信息提取单元,其基于原始的分析结果提取包含在原文中的复合句子的每个短句中的谓词的信息,并提取关于包含在原文中的复合句的每个句子中的语法和单词类的信息 基于初始翻译分析结果的初始翻译; 以及最终翻译创建单元,其通过使用关于谓词的信息和关于语法和单词类的信息从初始翻译生成最终翻译。 本发明提高了复合句的翻译性能。 (附图标记)(510)语素分析单元; (520)施工分析单位; (531)短句资格提取单位; (532)最优模式选择单元; (533)复杂句子翻译生成单元; (540)短句单位翻译生成单元; (550)语素生成单位; (AA)原语言输入句; (BB)对象语言翻译句子
-
公开(公告)号:KR1020130135442A
公开(公告)日:2013-12-11
申请号:KR1020120058995
申请日:2012-06-01
Applicant: 한국전자통신연구원
CPC classification number: G06F17/2809 , G06F17/271 , G06F17/2755 , G06F17/2785
Abstract: The present invention relates to a multilingual common conversion-based automatic translation device and a method capable of enabling bidirectional automatic multilingual translation for various multilingual language pairs by overcoming a limit on conversion technology targeting a uni-directional automatic multilingual translation for single language pairs applied to pattern-based automatic translation methodology. The automatic translation device according to the invention includes a multilingual analyzing part analyzing at least one among morphemes, constructions, and meanings about an inputted source language, a common converting part applying a format set based on a common concept system mutually used for both the source language and a target language to an analysis result of the multilingual analyzing part, a specializing part applying specialized knowledge about the usage of the language different from the system to the analysis result of the multilingual analyzing part, and a multilingual generating part generating the target language by using the output of the common converting part and the specializing part. [Reference numerals] (101) Multilingual analyzing part;(102) Common converting part;(103) Specializing part;(104) Multilingual generating part;(AA) Korean - English;(BB) Korean - Chinese
Abstract translation: 本发明涉及基于多语言共同转换的自动翻译装置和一种能够通过克服针对应用于单一语言对的单一语言对的单向自动多语言翻译的转换技术的限制来实现各种多语言对的双向自动多语言翻译的方法 基于模式的自动翻译方法。 根据本发明的自动翻译装置包括多语言分析部分,其分析关于所输入的源语言的语素,结构和含义中的至少一个,公共转换部分,基于相互用于源的共同概念系统应用格式集 多语言分析部分的分析结果的语言和目标语言,专门部分将与系统不同的语言的使用的专业知识应用于多语言分析部分的分析结果,以及生成目标语言的多语言生成部分 通过使用公共转换部分和专业部分的输出。 (101)多语言分析部分;(102)通用转换部分;(103)专业部分;(104)多语言生成部分;(AA)韩文 - 英文;(BB)韩文 - 中文
-
公开(公告)号:KR101333485B1
公开(公告)日:2013-11-28
申请号:KR1020100104288
申请日:2010-10-25
Applicant: 한국전자통신연구원
Abstract: 온라인 사전을 이용한 개체명 사전 구축 방법 및 이를 실행하는 장치가 개시된다. 먼저, 온라인 사전을 이용하여 개체명 분류 학습 데이터를 생성하고, 생성된 개체명 분류 학습 데이터에 대해 기계 학습을 통하여 개체명 분류 모델을 생성한 후, 생성된 개체명 분류 모델을 이용하여 온라인 사전의 표제어에 대한 개체명을 분류하고, 분류된 개체명을 개체명 사전에 추가한다. 따라서, 방대한 양의 개체명 사전을 정확하고 용이하게 구축할 수 있다.
-
公开(公告)号:KR101309839B1
公开(公告)日:2013-09-23
申请号:KR1020090118298
申请日:2009-12-02
Applicant: 한국전자통신연구원
Abstract: 본 발명은 통계 정보를 이용한 규칙 기반 구문 분석 장치 및 방법에 관한 것으로, 본 발명의 일실시 예에 따른 통계정보를 이용한 규칙 기반 구문분석 방법은, 입력 문장에 대해 구문 규칙을 적용함으로써 구문 분석을 수행하는 단계; 상기 입력 문장에 대해 적용되는 규칙에 주어진 규칙 확률과 어휘통계정보에 기반하여 계산된 어휘 의존 가중치를 이용하여 상기 규칙에 대한 규칙 가중치를 계산하는 단계; 각 구문트리에 사용된 규칙에 대해 계산된 상기 규칙 가중치들을 곱하여 각 구문트리의 가중치를 계산하고 가장 높은 가중치를 갖는 구문 트리를 선택하는 단계; 및 상기 선택된 구문 트리를 출력하는 단계를 포함한다.
상술한 바와 같은 본 발명은, 규칙기반 방식의 효율성과 통계기반 방식의 높은 모호성 처리 성능을 갖는 구문분석이 가능하다.
언어 처리, 구문 분석, 통계 정보, 규칙 기반-
公开(公告)号:KR101295642B1
公开(公告)日:2013-08-13
申请号:KR1020090120657
申请日:2009-12-07
Applicant: 한국전자통신연구원
Abstract: 본 발명은 음성인식결과 문장에 대한 문형분류장치 및 방법에 관한 것으로, 화자의 발성에 의해 입력된 음성을 인식하고, 음성 인식 결과에 의한 텍스트 문장을 대상으로 형태소를 분석하여, 형태소 분석 결과로부터 해당 문장의 문형을 분류하고, 문형 분류 결과에 따라 해당 문장의 말미에 문장 부호를 추가하는 의미분석모듈을 포함하며, 문장 부호가 추가된 문장을 타겟 언어로 번역하여 그에 대응하는 음성합성음을 출력한다. 본 발명에 따르면, 전체 자동통역 과정 중에서, 음성인식결과 및 이를 입력으로 하는 자동번역에 이르는 과정에서 음성인식결과에 의한 텍스트 문장의 문형을 분류하기 위한 것으로, 음성인식결과의 문형을 식별해 줌으로써, 자동번역 입력문의 문형정보를 보다 정확히 제공하게 되며, 궁극적으로 소스언어를 타겟언어로 자동통역하는 것이 원활히 이루어지게 되는 효과가 있다.
-
公开(公告)号:KR1020130014894A
公开(公告)日:2013-02-12
申请号:KR1020110076621
申请日:2011-08-01
Applicant: 한국전자통신연구원
IPC: G10L15/14
Abstract: PURPOSE: A word meaning recognizing device using engram is provided to recognize words for inputted sentences by setting the meaning information of the words. CONSTITUTION: A meaning information data managing unit(100) divides sentence corpuses according to an engram unit. The meaning information data managing unit generates engram characteristic values. The meaning information data managing unit generates relative characteristic values of the engram unit according to meaning information. A meaning recognizing unit(300) adds a meaning tag to the word of the inputted sentence based on a comparing result between the meaning information and a relative characteristic value. [Reference numerals] (110) Meaning information establishing unit; (120) Data storage unit; (130) Overlapped data removing unit; (140) N-gram information generating unit; (150) N-gram analyzing unit; (200) Sentence input unit; (300) Meaning recognizing unit
Abstract translation: 目的:提供使用engram的识别装置的单词,用于通过设置单词的含义信息来识别输入句子的单词。 构成:意义信息数据管理单元(100)根据枚举单位划分句子语料库。 含义信息数据管理单元生成图形特征值。 意义信息数据管理单元根据含义信息生成图形单元的相对特征值。 意义识别单元(300)基于意义信息和相对特征值之间的比较结果将意义标签添加到输入的句子的单词。 (附图标记)(110)含义信息建立单元; (120)数据存储单元; (130)重叠数据去除单元; (140)N-gram信息生成单元; (150)N-gram分析单元; (200)句子输入单元; (300)含义识别单位
-
公开(公告)号:KR1020120089502A
公开(公告)日:2012-08-13
申请号:KR1020100125870
申请日:2010-12-09
Applicant: 한국전자통신연구원
CPC classification number: G06F17/289 , G06F17/2809 , G06F17/27 , G06F17/271 , G06F17/2755 , G06F17/28 , G06F17/2854
Abstract: PURPOSE: A translation knowledge server generating method and an apparatus thereof are provided to obtain a translation knowledge and to apply the obtained knowledge to a translation engine. CONSTITUTION: A data analysis unit(103) performs morphological analysis and syntax analysis of initial translation knowledge data collected from a data collecting unit. The data analysis unit outputs the analyzed data. A translation knowledge learning unit(105) determines a target word by domain according to predetermined domain information. The translation knowledge learning unit determines a domain through automatic leaning clustering. The translation knowledge learning unit learns translation knowledge in real time.
Abstract translation: 目的:提供翻译知识服务器生成方法及其装置,以获得翻译知识并将获得的知识应用于翻译引擎。 构成:数据分析单元(103)执行从数据收集单元收集的初始翻译知识数据的形态分析和语法分析。 数据分析单元输出分析数据。 翻译知识学习单元(105)根据预定域信息确定目标单词。 翻译知识学习单元通过自动倾斜聚类来确定域。 翻译知识学习单元实时学习翻译知识。
-
公开(公告)号:KR1020120088032A
公开(公告)日:2012-08-08
申请号:KR1020100101419
申请日:2010-10-18
Applicant: 한국전자통신연구원
IPC: G06F17/28
CPC classification number: G06F17/28 , G06F17/2818 , G06F17/2854 , G06F17/2872 , G06F17/289
Abstract: PURPOSE: An automatic real time translation knowledge extracting/verifying method and an apparatus thereof are provided to automatically build a translation knowledge of an automatic translation system and to reduce installation costs for building existing automatic translation knowledge dictionary. CONSTITUTION: A parallel corpus collecting unit(100) collects a parallel corpus on a web. The parallel corpus collecting unit removes a tag such as HTML. The parallel corpus collecting unit generates the parallel corpus. A translation knowledge extracting block(110) extracts predetermined translation knowledge. A translation knowledge evaluating block(130) evaluates and converts the extracted translation knowledge into a first evaluation element and a second evaluation element.
Abstract translation: 目的:提供自动实时翻译知识提取/验证方法及其装置,以自动构建自动翻译系统的翻译知识,并降低构建现有自动翻译知识词典的安装成本。 构成:平行语料库收集单元(100)在网上收集平行语料库。 平行语料库收集单元删除诸如HTML的标签。 平行语料库收集单元生成平行语料库。 翻译知识提取块(110)提取预定的翻译知识。 翻译知识评估块(130)评估并将所提取的翻译知识转换为第一评估元素和第二评估元素。
-
公开(公告)号:KR1020120052683A
公开(公告)日:2012-05-24
申请号:KR1020100113955
申请日:2010-11-16
Applicant: 한국전자통신연구원
IPC: G06Q50/00
CPC classification number: H04L67/306 , H04L67/18 , H04W4/021 , H04W4/023
Abstract: PURPOSE: A multilateral situation information sharing apparatus for an intelligent service and method thereof are provided to offer preference information to group users by analyzing situation information of multiple purposes user groups. CONSTITUTION: A user situation information management apparatus(260) collects situation information based on a sensor and preference information based on user actions including user log information of each terminal. The user situation information management apparatus identifies purpose user groups by receiving situation information collecting requests for a purpose user group. A user preference information analysis apparatus(250) calculates user preference information by analyzing preference information based on the user action and the situation information based on the sensor.
Abstract translation: 目的:提供一种用于智能服务的多边情报信息共享装置及其方法,通过分析多用途用户组的情况信息,为组用户提供偏好信息。 构成:用户情况信息管理装置(260)基于包括每个终端的用户日志信息的用户动作,基于传感器和偏好信息来收集情况信息。 用户情况信息管理装置通过接收对目的用户组的请求的情况信息来识别目的用户组。 用户偏好信息分析装置(250)通过基于用户动作和基于传感器的情况信息分析偏好信息来计算用户偏好信息。
-
公开(公告)号:KR1020110067276A
公开(公告)日:2011-06-22
申请号:KR1020090123802
申请日:2009-12-14
Applicant: 한국전자통신연구원
CPC classification number: G06F17/278 , G06F17/2755 , G06F17/2863
Abstract: PURPOSE: A Korean language analyzing apparatus is provided to improve the efficiency of analyzing a noun phrase structure by determining noun phrase analysis candidates on the basis of a noun vocabulary pattern and a verb phrase pattern. CONSTITUTION: The inflected word expression/vocabulary pattern building block [verb phrase/vocabulary pattern building block](102) uses the sentences of the analysis object domain. The inflected word expression pattern and vocabulary pattern are away driven. A verb phrase/vocabulary pattern building block builds verb phrase patterns and vocabulary patterns using the sentences in a domain to be analyzed. An analysis candidate generating block(104) generates noun phrase analysis candidates by searching for a noun that can be a declinable word from the noun phrase sentence to be analyzed. A priority determining block(108) produces a preference from the established vocabulary patterns and determines the priority of the noun phrase analysis candidates. An analysis candidate selecting block(110) determines on noun phrase analysis candidate in consideration of the preference and inter-noun restriction.
Abstract translation: 目的:提供韩语分析装置,以通过基于名词词汇模式和动词短语模式确定名词短语分析候选来提高分析名词短语结构的效率。 构成:词汇表达/词汇模式构建块[动词短语/词汇模式构建块](102)使用分析对象域的句子。 转折词表达模式和词汇模式被驱逐。 动词短语/词汇模式构建块使用要分析的域中的句子构建动词短语模式和词汇模式。 分析候选生成块(104)通过从要分析的名词短语句中搜索可以是可拒绝词的名词来生成名词短语分析候选。 优先级确定块(108)从所建立的词汇模式产生偏好,并确定名词短语分析候选者的优先级。 考虑到偏好和名词之间的限制,分析候选选择块(110)确定名词短语分析候选者。
-
-
-
-
-
-
-
-
-