Abstract:
PURPOSE: A system and a method for automatically classifying documents are provided to accurately obtain information on a document classified by categories, and to search and provide information in which the user is interested. CONSTITUTION: A morpheme analyzer(103) receives documents collected and link titles and extracts related terms. A term clustering generator(101) receives the terms extracted from the morpheme analyzer(103) and extracts keywords by documents. In addition, the term clustering generator(101) generates lists of the keywords by documents and term clusters. A gene learning classification device(102) receives the lists and the term clusters generated from the term cluster generator(101). In addition, the gene learning classification device(102) extracts term clusters for the keywords and infers categories of related fields.
Abstract:
PURPOSE: A system and method for creating a three dimensional clustering are provided to link a word to a document horizontally and to classify a word and a document based on a directory by grafting the hierarchy terms concept in a clustering method based on the existing similarity. CONSTITUTION: A word extracting device(101) extracts actual contents of an HTML document using an HTML DTD(Document type definition), and extracts an actual word out of filtered contents using an HTML filter and a dictionary(104), and creates an index file(107) by embodying a weight value by words. A cluster creating device(102) extracts a word having a value more than the critical weight value out of words of the corresponding document, and creates a word group(cluster) based on the extracted each word, and compares the word group(cluster) with the existing word group(cluster dictionary:105) based on the corresponding word, and creates new word group(cluster dictionary:105) based on the corresponding word. A 3-D cluster processor(103) checks whether a query language of a user exists in a classification dictionary(106). If a query language of a user does not exist in a classification dictionary, a document value is outputted using a cluster corresponded to the query language and the index file(107) in the cluster dictionary(105). If a query language of a user exists in a classification dictionary, the upper and lower linking word in the classification dictionary(106), a query language and a cluster with respect to the upper and lower linking word in the cluster dictionary(105) based on the upper and lower linking word are searched, and a document value is outputted using the cluster dictionary(105).
Abstract:
PURPOSE: A system for creating and searching multi media data based on a XML and a method for creating multi media data using the system are provided to manufacture and search new-formed multi media data by unifying index information and multi media data based on a XML(extended markup language). CONSTITUTION: A description generator(110) receives a XML document structure for expressing multi media information and multi media data and describes multi media information by the XML document structure and inserts the information into the multi media data, and creates XML multi media data. A multi media data searching server(120) divides the XML multi media data into multi media information and search information and stores the information. The multi media data searching server(120) outputs information by performing a search if a search requesting message is received. A multi media data searching player(130) transmits the search requesting message to the multi media data searching server(120) and receives the XML multi media data from the multi media data searching server(120) and shows the data entirely or as the unit of a scene.