Abstract:
PURPOSE: A method for storing and searching information based on a web base, and a system for managing of the same are provided to store extracted tupe and triple information to inverse-index structure extracted through high-quality language analysis such as triple/recognizing individual name and relation extraction thereby shortening a search time. CONSTITUTION: A language analysis block(100) performs language analysis of structure/non-structure. An object name recognition block(110) recognizes object name in the document. A triple storage block(130) stores information of extracted tupler type and extracted triple type by expanding reverse index structure. A query analysis block(140) extracts pattern of search information of tuple or triple type search information, after analysis a user query. A triple search block(150) performs search from the inverse-index structure.
Abstract:
본 발명은 MPEG-7로 표현된 멀티미디어 콘텐츠를 검색하기 위한 사용자 질의를 MPEG-7 질의 포맷으로 변환하여 멀티미디어 콘텐츠를 검색하는 방법 및 장치에 관한 것이다. 본 발명에 의한 멀티미디어 콘텐츠 검색 방법은 사용자의 질의를 MPEG-7 문서의 특정 영역을 지시하는 지시자와 상기 지시자를 참조하는 참조자를 이용하여 표현하는 단계와, 상기 지시자와 상기 참조자를 이용하여 표현된 상기 사용자의 질의의 의미를 해석하는 단계와, 상기 해석 결과에 따라 해당 멀티미디어 콘텐츠를 검색하는 단계를 포함한다. 이러한 본 발명에 의하면, MPEG-7 질의 포맷에서 2 이상의 검색 조건이 동일한 구조 내에서 모두 충족된다는 것이나, 또한 서로 다른 MPEG-7 문서를 참조하고 있다는 것을 명시적으로 표현할 수 있다. 또한, 검색 과정에서 사용자 질의의 의미가 정확하게 해석되므로 사용자 질의에 부합하는 멀티미디어 콘텐츠가 정확하게 검색될 수 있다. MPEG-7, 질의 포맷, 지시자, 참조자, 멀티미디어 콘텐츠
Abstract:
A method for searching for media information via natural language analysis is provided to offer a scheme for searching the media wanted by a user by making efficient analysis of a user's natural language query. A method for searching for media information via natural language analysis comprises the following several steps. If media information(101) is stored at a database, metadata is extracted from the inputted media information(103). The metadata matched with the media information is stored at a metadata index database(105). If natural language media search query information(111) is inputted, the inputted media search query is analyzed and a metadata analysis rule(113) is extracted. Then, the metadata analysis rule is stored at a metadata recognition rule database(115). If a user starts a media search operation(121), the natural language search query is recognized as metadata by using the metadata recognition rule database, and the media information matched with the recognized data is searched by using the metadata index database.
Abstract:
An apparatus and a method for generating a response sentence are provided to perform exact meaning analysis of a voice-recognized sentence by performing second point of sentence/substitutes extraction and second meaning analysis with respect to the voice-recognized sentence. A response sentence generating method comprises the following steps of: performing morpheme analysis of a voice-recognized sentence(200,210); extracting a first point of sentence from the sentence(220); performing first meaning analysis of the sentence based on the extracted first point of sentence(230); extracting a second point of sentence including the first point of sentence from the sentence based on the first meaning analysis result in order to further extract point of sentence which are not extracted in the above second step(240); generating a meaning analysis result of the voice-recognized sentence by performing second meaning analysis of the sentence based on the extracted second point of sentence(250); and generating a response sentence to the voice-recognized sentence based on the generated meaning analysis result(260).
Abstract:
A method and an apparatus for displaying information based on hierarchical classification are provided to supply a user visually with a variety of classification information configured in a hierarchical form as a basic structure and statistical information about mass media indexed under lately-issued ontology based semantic web environment, and browse the media information. A first classification reference corresponding to at least two pre-decided ontology semantic structure classes is selected(201). Information corresponding to each selected first classification reference is searched. The searched information is classified in correspondence with a second classification reference as an ontology semantic structure class lower than the first classification reference. A matrix where the at least two first classification references are respectively an axis and the second classification reference is a component of the axis is generated(207).
Abstract:
본 발명은 URL 포함 관계에 기반한 유사도 재계산을 통한 효과적인 홈페이지 검색방법에 관한 것이다. 본 발명은 같은 홈페이지에 속하는 웹 문서들의 URL들 간의 포함 관계를 이용하여 웹 문서들 중에서 그 홈페이지의 엔트리 포인트를 찾아내는 기술이다. 본 발명의 핵심은 어떤 문서의 URL이 다른 문서의 URL의 부분열(substring)이면 전자가 후자보다 홈페이지 즉 엔트리 포인트가 될 가능성이 높다는 성질을 이용한 것이다. 즉, 본 발명은 웹 검색에 있어서 종래 정보 검색 기법을 개선하여 홈페이지의 엔트리 포인트가 되는 페이지를 다른 문서들 보다 우선하여 검색되도록 함으로써, 사용자들이 검색된 웹 문서의 URL을 일일이 방문하지 않고도 검색된 웹 문서가 홈페이지인지 여부를 쉽게 알 수 있게 되는 이점이 있으며, 또한 사용자가 입력한 검색 질의가 포함하는 단어를 가지는 웹 문서들의 사이트 정보 즉 홈페이지를 우선적으로 검색하여 줌으로써 홈페이지를 통해서 더욱 많은 정보를 얻을 수 있게 되어 검색이 보다 편리해지는 이점이 있다.
Abstract:
A method and system for automatically indexing questions/answers based on the analysis of a language is provided to sort vocabularies and phrases through analyzing the language of various documents, automatically generate a corresponding natural language question, and store the question and corresponding answer in order to present good candidate of the answer corresponding to the question. A question/answer system includes an indexing engine(100) and a user question/answer engine(200). The indexing engine(100) includes a language analyzing unit(10), a candidate sentence selecting unit(20), a natural language question generating unit(30), and a question/answer indexing unit(40). The indexing engine(100) also includes an index database(42) and a question/answer database(44). The user question/answer engine(200) searches the indexed question/answer and answers the inquiry of the user. The user question/answer engine(200) includes a language analyzing unit(10), a question analyzing unit(50), a question detecting unit(60), and an outputting unit(70). The language analyzing unit(10) analyzes the language structure of the inputted various documents.
Abstract:
A method and device for automatically declaring pronunciation foreign language is provided to form a phoneme changing rule according to characteristic of the pronunciation of the Korean language, and declare the pronunciation of the foreign language using the phoneme changing rule. A phoneme changing rule is generated by analyzing statics data of the foreign language and a phoneme changing rule DB is constructed(S201). A weight of the phoneme changing rule is calculated and stored in the phoneme changing rule DB(S202). An inputted foreign language is divided into the phoneme(S203). A plurality of phoneme candidates are generated by referring the phoneme changing rule DB and applying phoneme changing rule into the phoneme rule(S204). The weight values corresponding to the phoneme candidates are calculated according to the phoneme changing rule(S205).
Abstract:
PURPOSE: A nonlinear quantization and similarity matching method for retrieving a video sequence having plural image frames is provided to configure a bit expression of an edge histogram descriptor having reduced bits for a video sequence including plural image sets, and to retrieve the video sequence with extracted information from the coded expression, thereby reducing the number of bits. CONSTITUTION: A system selects one image frame of a video sequence as a target image frame(S110), and divides the selected frame into sub images(S111). The system extracts edge histograms from the sub images(S112), and decides whether the edge histograms are generated for all the sub images(S113). If so, the system increases a constant by 1 to select a next image frame of the video sequence(S115). The system decides whether all image frames are selected from the video sequence(S116). If so, the system generates a representative edge histogram bin as the first image descriptor(S117), and creates a quantization index value group(S118).
Abstract:
PURPOSE: A non-linear quantization and similarity matching methods for retrieving image data is provided to construct a database to store image information representing a plurality of images with fewer bits, and to retrieve corresponding images in response to a query image based on a database with a high retrieval speed and accuracy. CONSTITUTION: L.times.5 number of normalized edge histogram bins are calculated to generate L number of edge histograms of a target image, wherein L is a positive integer and each edge histogram has five normalized edge histogram bins and represents a spatial distribution of five reference edges in a sub-image, wherein the reference edges include four directional edges and a non-directional edge(S101). The L.times.5 number of normalized edge histogram bins are non-linearly quantized to generate L.times.5 number of quantization index values for the target image(S103). The L.times.5 number of quantization index values are stored in the database(S105). And the steps S101 to S105 are repeated until all of the stored images are processed to construct the database having the image information(S107).