Abstract:
PURPOSE: A device and method for classifying a document of a single class category are provided to perform exact document classification by using an association rule extracted by an association rule detection method as a quality for document classification. CONSTITUTION: An associative rule training unit(100) generates matrices of qualities from a learning document set to generate an association rule candidate with a depth or widths primary search method. The associative rule training unit generates an associative rule training model from association rules candidates. A document class category classifier(150) uses an association rules learning model to classify a document of a document set.
Abstract:
PURPOSE: A device and a method for processing web information by extracting local information are provided to integrate various web information around related regional information to provide processed document data. CONSTITUTION: A major information extracting unit(150) extracts major information including regional information from document data according to a result of language analysis and a selected topic. A related information mapping unit(170) groups and maps the document data. An information integrating unit(180) compares the mapped document data. The information integrating unit integrates the document data according to the comparison result.
Abstract:
PURPOSE: A device and a method for keyword extraction and an associative word network configuration of document data are provided to extract automatically issue key word from a Blog document group and constitute an associative network in between extracted key words, thereby showing exact keyword according to each document. CONSTITUTION: An issue keyword extractor(104) parses structure information of a document in an inputted web document group. An issue keyword extractor extracts an issue keyword based on analyzed morpheme. An associative work network configurator(106) extracts relations between extracted issue keywords. An indexing unit(108) indexes extracted issue keywords and configured associated word network. According to a control command, a presentation unit(114) suggests the issue keyword and associated word network information.
Abstract:
PURPOSE: A topic map based indexing device, a topic map based searching device, a topic map based searching system and a method thereof are provided to obtain question analyzing information about question of a user and search similar questions in a community Q/A topic map according to question analyzing information and effectively outputs an answer, thereby searching most suitable answer. CONSTITUTION: A Q/A pre-processing block(102) normalizes the community Q/A list as monolithic. A Q/A analysis block(104) obtains Q/A analyzing information through analyze of the community Q/A list. A Q/A stores block stores indexing information through duplicated answer removal, meaningless answer removal, an answer list sorting, extracting answer of the top order and topic decision according to the Q/A analyzing information as community Q/A topic map.
Abstract:
PURPOSE: A method for storing and searching information based on a web base, and a system for managing of the same are provided to store extracted tupe and triple information to inverse-index structure extracted through high-quality language analysis such as triple/recognizing individual name and relation extraction thereby shortening a search time. CONSTITUTION: A language analysis block(100) performs language analysis of structure/non-structure. An object name recognition block(110) recognizes object name in the document. A triple storage block(130) stores information of extracted tupler type and extracted triple type by expanding reverse index structure. A query analysis block(140) extracts pattern of search information of tuple or triple type search information, after analysis a user query. A triple search block(150) performs search from the inverse-index structure.
Abstract:
본 발명은 MPEG-7로 표현된 멀티미디어 콘텐츠를 검색하기 위한 사용자 질의를 MPEG-7 질의 포맷으로 변환하여 멀티미디어 콘텐츠를 검색하는 방법 및 장치에 관한 것이다. 본 발명에 의한 멀티미디어 콘텐츠 검색 방법은 사용자의 질의를 MPEG-7 문서의 특정 영역을 지시하는 지시자와 상기 지시자를 참조하는 참조자를 이용하여 표현하는 단계와, 상기 지시자와 상기 참조자를 이용하여 표현된 상기 사용자의 질의의 의미를 해석하는 단계와, 상기 해석 결과에 따라 해당 멀티미디어 콘텐츠를 검색하는 단계를 포함한다. 이러한 본 발명에 의하면, MPEG-7 질의 포맷에서 2 이상의 검색 조건이 동일한 구조 내에서 모두 충족된다는 것이나, 또한 서로 다른 MPEG-7 문서를 참조하고 있다는 것을 명시적으로 표현할 수 있다. 또한, 검색 과정에서 사용자 질의의 의미가 정확하게 해석되므로 사용자 질의에 부합하는 멀티미디어 콘텐츠가 정확하게 검색될 수 있다. MPEG-7, 질의 포맷, 지시자, 참조자, 멀티미디어 콘텐츠
Abstract:
본 발명은 한국어 대화체 음성합성시스템에서 화맥(speech context) 정보를 이용하여 특정 형태에 대해 선택적으로 운율을 구현하는 방법에 관한 것이다. 본 발명은 합성시스템의 입력 문장 가운데, 형태가 같으면서 선택적으로 운율이 구현될 필요가 있는 단어나 어미 등에 대해 문장의 화행(speech act) 정보나 문형 정보를 포함하는 화맥 정보를 이용하여 태깅을 해 주고, 음성 합성시에 태깅된 특정 형태에 맞는 음편(speech segment)이 마킹된 합성단위 DB에서 해당 음편을 선택적으로 추출하여 대화 맥락 또는 문장의 유형에 맞는 운율을 다양하게 구현하는 방법을 제공한다. TTS, 대화체 음성합성시스템, 음성 합성, 대화체, 화맥정보, 운율