-
公开(公告)号:KR1020120064559A
公开(公告)日:2012-06-19
申请号:KR1020100125839
申请日:2010-12-09
Applicant: 한국전자통신연구원
CPC classification number: G06F17/30864 , G06F17/00 , G06F17/20 , G06F17/21 , G06F17/215 , G06F17/218 , G06F17/27 , G06F17/28 , G06F17/2881 , G06F17/30 , G06F17/30654 , G06F17/30657 , G06F19/00
Abstract: PURPOSE: An apparatus and a method for question analysis for open web question-answering are provided to pre-treat about external question, to analyze language, to recognize information of a question, to recognize the type of answer about the question, and extract keyword of the question. CONSTITUTION: Apparatus for question analysis for open web question-answering comprises: a query language analysis unit(110) conducting language analysis of provided question; a query recognition unit(120) recognizing at least one query type recognition result, address information, subject information and query type information from the language-analyzed question; a query answer type recognition unit(130) recognizing the answer type of the question; and a keyword recognition unit(140) extracting keyword from the question, and expanding the extracted keyword.
Abstract translation: 目的:提供一种用于开放式Web问答的问题分析的设备和方法,用于预处理外部问题,分析语言,识别问题的信息,识别问题的答案类型,并提取关键字 的问题。 构成:用于开放式网络问答的问题分析装置,包括:对提供的问题进行语言分析的查询语言分析单元(110) 从语言分析的问题中识别至少一个查询类型识别结果,地址信息,主题信息和查询类型信息的查询识别单元(120) 查询答案类型识别单元(130),识别问题的答案类型; 以及关键词识别单元(140)从所述问题中提取关键词,并扩展所提取的关键字。
-
公开(公告)号:KR1020120061133A
公开(公告)日:2012-06-13
申请号:KR1020100104145
申请日:2010-10-25
Applicant: 한국전자통신연구원
CPC classification number: G06F17/30657
Abstract: PURPOSE: A question type/domain recognition apparatus and a method thereof are provided to maximize the efficiency of character search for correct answer extraction by searching for an answer about user question in a local information restricting technique. CONSTITUTION: A question type recognition apparatus(100) recognizes the number of words and a question for information search. A question domain distributing unit(200) distributes a domain specified question/answer engine into a user question engine. A question/answer engine block(300) configures domain specified question/answer engines in response to distribution of the question domain distributing unit.
Abstract translation: 目的:提供一种问题类型/域识别装置及其方法,以通过在本地信息限制技术中搜索关于用户问题的答案来最大化正确答案提取的字符搜索的效率。 构成:问题类型识别装置(100)识别词的数量和信息搜索的问题。 问题域分发单元(200)将域指定的问题/答案引擎分发到用户问题引擎中。 问题/答案引擎块(300)根据问题域分发单元的分布配置域指定的问题/答案引擎。
-
公开(公告)号:KR1020120037841A
公开(公告)日:2012-04-20
申请号:KR1020100099546
申请日:2010-10-12
Applicant: 한국전자통신연구원
CPC classification number: G06F17/3087 , G06F17/30867 , G06Q30/0201 , H04L67/306
Abstract: PURPOSE: A personalized searching method of a mobile terminal and a mobile terminal thereof are provided to generate a preference analysis model based on usage record of a user and to apply the preference analysis model to an area search result. CONSTITUTION: A mobile terminal obtains location information of the mobile terminal and a query language from a user(210,220). The mobile terminal generates an area search result(230,240). The mobile terminal displays the area search result(250). The mobile terminal stores the usage record of the user(260). The mobile terminal generates a preference analysis model of the user.
Abstract translation: 目的:提供移动终端及其移动终端的个性化搜索方法,以基于用户的使用记录生成偏好分析模型,并将偏好分析模型应用于区域搜索结果。 构成:移动终端从用户获取移动终端的位置信息和查询语言(210,220)。 移动终端生成区域搜索结果(230,240)。 移动终端显示区域搜索结果(250)。 移动终端存储用户的使用记录(260)。 移动终端生成用户的偏好分析模型。
-
公开(公告)号:KR1020120036761A
公开(公告)日:2012-04-18
申请号:KR1020110101937
申请日:2011-10-06
Applicant: 한국전자통신연구원
Abstract: PURPOSE: A system and a method for providing a multimedia service in a communication system are provided to accurately search contract information about various multimedia services by transmitting and receiving search protocol and a message. CONSTITUTION: A generating unit(230) generates a query message which searches contract information in a data base and checks contract information from the query result message. The generating unit generates a response message including contract information. A transmitting unit(240) transmits the query message to a database and transmits the response message including the contract information to a terminal.
Abstract translation: 目的:提供一种在通信系统中提供多媒体业务的系统和方法,用于通过发送和接收搜索协议和消息来准确地搜索关于各种多媒体业务的合同信息。 构成:生成单元(230)生成搜索数据库中的合同信息并从查询结果消息检查合同信息的查询消息。 生成单元生成包括合同信息的响应消息。 发送单元(240)将查询消息发送到数据库,并将包括合同信息的响应消息发送到终端。
-
公开(公告)号:KR101095866B1
公开(公告)日:2011-12-21
申请号:KR1020080125436
申请日:2008-12-10
Applicant: 한국전자통신연구원
Abstract: 본 발명은 정보검색, 질의응답, 시맨틱 웹(semantic web) 등에서 튜플(tuple) 또는 트리플(triple) 형태의 정보를 인덱싱(indexing)하여 저장하고 검색하는 기술에 관한 것이다. 이를 위해 본 발명은, 인덱싱의 대상이 되는 문서로부터 형태소 분석, 명사구 청킹(chunking) 등을 수행하고, 언어분석 결과를 입력받아 사람, 장소, 조직 등의 개체명을 인식하며, 문장으로부터 주어와 목적어를 인식하고 주어와 목적어의 관계를 추출하고, 추출된 정보들을 트리플 인덱스에 인코딩하여 저장하며, 사용자의 키워드 또는 자연어질의를 분석하여 개체명 및 관계추출 정보를 분석하는 것을 특징으로 한다. 본 발명으로 인해, 시맨틱 웹 및 정보검색 분야에서 튜플 또는 트리플 형태의 정보를 효율적으로 저장하고 빠르게 검색할 수 있다.
정보검색, 질의응답, 시맨틱 웹, 온톨로지(Ontology), 튜플, 트리플, 색인저장 하부구조-
公开(公告)号:KR1020110064833A
公开(公告)日:2011-06-15
申请号:KR1020090121578
申请日:2009-12-09
Applicant: 한국전자통신연구원
CPC classification number: G06F17/30958
Abstract: PURPOSE: A knowledge graph purifying apparatus and method are provided to eliminate a duplicate entity and simplify the structure of a knowledge graph. CONSTITUTION: A knowledge graph generating unit(110) creates a knowledge graph. In the knowledge graph, the same entities are expressed as one node. An entity similarity calculator(120) calculates meaningful similarity between all possible entity pairs of the same entity type. A same entity integrating unit(130) selects a representative entity in the entity pair. The same entity integrating unit merges the rest entities to the representative entity.
Abstract translation: 目的:提供一种知识图提纯装置和方法,以消除重复的实体并简化知识图的结构。 构成:知识图生成单元(110)创建知识图。 在知识图中,相同的实体表示为一个节点。 实体相似度计算器(120)计算相同实体类型的所有可能实体对之间的有意义的相似性。 相同的实体集成单元(130)选择实体对中的代表实体。 同一实体整合单元将其余实体合并到代表实体。
-
公开(公告)号:KR1020110061229A
公开(公告)日:2011-06-09
申请号:KR1020090117819
申请日:2009-12-01
Applicant: 한국전자통신연구원
CPC classification number: G06F17/2735 , G06F17/278 , G06F17/30731
Abstract: PURPOSE: A system and method for semi-automatic construction of acronym dictionary is provided to improve the language analysis performance by semi-automatically finding an abbreviation word and an original word of the abbreviation word. CONSTITUTION: Some parts of an electronic document are extracted(S210). A proper noun which is included in the electronic document is acknowledged(S220). An abbreviation word about a proper noun candidate is extracted(S230). The abbreviation word candidates are listed in priority order(S240). An abbreviation word is selected(S250). The selected abbreviation word is stored into an abbreviation dictionary database.
Abstract translation: 目的:提供半自动构建首字母缩写词典的系统和方法,通过半自动找到缩写词的缩写词和原始词来提高语言分析性能。 规定:提取电子文档的某些部分(S210)。 电子文档中包含的专有名词被确认(S220)。 提取关于专名候选人的缩写词(S230)。 缩写词候选者以优先顺序列出(S240)。 选择缩写词(S250)。 所选择的缩写词被存储在缩写字典数据库中。
-
公开(公告)号:KR1020110040147A
公开(公告)日:2011-04-20
申请号:KR1020090097306
申请日:2009-10-13
Applicant: 한국전자통신연구원
CPC classification number: G06F17/30654
Abstract: PURPOSE: An apparatus for answering to a question based on answer trustworthiness and a method thereof are provided to evaluate the reliability for the correct answer candidates to a user query in score and reflect the scored reliability to the correct answer priority in order to provide a user with a reliable correct answer. CONSTITUTION: A correct answer indexing unit(20) indexes the documents of which document reliability satisfies a critical value, and stores the indexed documents in a knowledge storage unit. A correct answer candidate extraction unit(40) extracts the correct answer candidate documents for the user query from the knowledge storage unit. A correct answer source reliability measurement unit(53) analyzes a non-textual feature of the correct answer candidate documents in order to measure source reliability. A correct answer extraction strategy reliability measurement unit(55) analyzes the validity of the extraction strategy of the correct answer candidate documents to measure the extraction strategy reliability.
Abstract translation: 目的:提供一种基于应答可信赖性回答问题的方法及其方法,以评估用户查询的正确答案候选人的得分可靠性,并将得分可靠性反映到正确答案优先级,以提供用户 有一个可靠的正确答案。 构成:正确答案索引单元(20)对文件的可靠性满足临界值的文档进行索引,并将索引的文档存储在知识存储单元中。 正确答案候选提取单元(40)从知识存储单元提取用于用户查询的正确答案候选文档。 正确答案源可靠性测量单元(53)分析正确答案候选文件的非文本特征以便测量源可靠性。 正确答案提取策略可靠性测度单元(55)分析正确答案候选文件的提取策略的有效性,以测量提取策略的可靠性。
-
公开(公告)号:KR1020100068964A
公开(公告)日:2010-06-24
申请号:KR1020080127490
申请日:2008-12-15
Applicant: 한국전자통신연구원
IPC: G06F17/30
CPC classification number: G06F17/30867 , G06F17/3053 , G06F17/30536 , G06F17/30887
Abstract: PURPOSE: An apparatus for recommending related query and a method thereof are provided to use a click log of a search engine thereby suggesting related query language correlating with inputted initial query language. CONSTITUTION: An information extraction unit(112) extracts a plurality of each different query languages, URL(Uniform Resource Locator) information selected by the query languages and time information selected each URL reference to a click log. An index unit(116) calculates relation between the URL and the query language and generates click log index data. If the query language is inputted, a server controller(110) searches related URL and query language reference to the click log index data and classifies the related query language by a category and recommends a related query language having correlation with the query language relatively.
Abstract translation: 目的:提供一种用于推荐相关查询的装置及其方法,以使用搜索引擎的点击日志,从而建议与输入的初始查询语言相关的相关查询语言。 构成:信息提取单元(112)提取多个每个不同的查询语言,由查询语言选择的URL(统一资源定位符)信息和选择每个URL引用的时间信息到点击日志。 索引单元(116)计算URL和查询语言之间的关系,并生成点击日志索引数据。 如果输入了查询语言,则服务器控制器(110)搜索相关的URL和查询语言参考点击日志索引数据,并按类别对相关查询语言进行分类,并推荐与查询语言相关的相关查询语言。
-
公开(公告)号:KR1020100066920A
公开(公告)日:2010-06-18
申请号:KR1020080125438
申请日:2008-12-10
Applicant: 한국전자통신연구원
CPC classification number: G06F17/30011 , G06F17/30613 , G06F17/30705
Abstract: PURPOSE: An electronic document processing device and a method thereof are provided to determine duplicated document according to duplicate sentence rate of electronic document and reduce target electronic document effectively, thereby increasing efficiency of query response. CONSTITUTION: A sentence separation block(106) separates each sentence in extracted body content. A duplicated document decision block(108) changes the separated documents through hash algorithm to inherent hash value. According to collision between the changed hash value and pre-stored hash value, the duplicated document decision block determines duplicated sentence. The duplicated document decision block determines duplicated document according to duplicated document ratio of the electronic document.
Abstract translation: 目的:提供电子文件处理装置及其方法,根据电子文件的重复句子率确定重复的文件,有效减少目标电子文件,从而提高查询响应的效率。 构成:句子分离块(106)分离提取的身体内容中的每个句子。 复制文档决策块(108)通过散列算法将分离的文档改变为固有散列值。 根据改变的哈希值和预先存储的哈希值之间的冲突,复制的文档决定块确定重复的句子。 复制的文档决定块根据电子文档的重复文档比例确定重复的文档。
-
-
-
-
-
-
-
-
-