Abstract:
PURPOSE: A system and a method for guiding a standard sentence pattern are provided to guide a user in a standard sentence pattern in real time by consulting the standard Korean sentence pattern. CONSTITUTION: The system includes an input unit(110) through which a user inputs a sentence, a morpheme analyzing unit(120) to divide the inputted sentence into the syllables and to analyze the morphemes composing the syllables, a vocabulary information extracting unit(130) to extract the vocabulary information of the analyzed sentence, a standard sentence pattern rule determining unit(140) to determine standard sentence pattern rules and apply the rules, and an output unit(170) to output a standardized sentence pattern of the inputted sentence depending on the applied standard sentence pattern rules. The standard sentence pattern rule determining unit applies the most proper one among the Korean standard sentence pattern rules loaded and pre-stored in the system. A standard sentence pattern is verified and deducted by using the abbreviation deduction patterns sought by an abbreviation deduction pattern search unit(150).
Abstract:
본 발명은 문서에서 이벤트 문장을 추출하는 장치 및 그 방법에 관한 것이다. 본 발명은 언어처리부(10)에서 입력 문서집합에 대해 형태소 분석 및 개체명 인식을 수행하고, 문서집합 학습부(20)에서 학습용 문서들을 언어처리한 결과를 이용해 동사, 명사 및 명사구 자질을 추출하고 각각에 대한 가중치를 계산함으로써 중요 자질을 선택해 데이터베이스에 저장하며, 이벤트 문장 추출부(30)에서 언어처리부(10)가 추출용 문서를 언어처리한 결과와 문서집합 학습부(20)가 학습한 결과를 비교 분석함으로써 추출용 문서 내의 각 문장에 대한 가중치를 계산하고 추출 조건에 따라 이벤트 문장을 추출하도록 되어 있으며, 이에 따라서, 문서로부터 도메인 의존적인 정보를 함축하고 있는 유용한 자료들을 선별하여 손쉽게 획득할 수 있다.
Abstract:
PURPOSE: A device and a method for recognizing an object name on a Korean text are provided to recognize the object name on the Korean text through the reinforced learning using a co-training based on the HMM(Hidden Markov Model). CONSTITUTION: A morpheme parser(10) separates the input text into a list of sentences, forms a morpheme list separating each sentence into the morpheme unit tagging a state label, generates/stores an HMM data structure in a memory. A statistics information extractor(20) extracts the HMM object name statistics information from an object name tagged text set. A co-training learning device(40) extends the statistics information through the current learning data by extracting the HMM object name statistics information from an unlabeled text set as advancing the co-training learning based on the HMM statistics information extracted from the object name tagged text set. An object name recognizer(50) recognizes the object name by deciding an optimal HMM object name statistics information path of the morphemes forming the sentences of the input text through a viterbi algorithm.
Abstract:
PURPOSE: An apparatus and method for storing/restoring SGML/XML entity is provided to prevent a waste of an auxiliary memory and to store SGML/XML document more rapidly and to restore entities stored in a database in various forms into the original entities in accordance with the form of each entity by sharing an entity declaration being referred to a plurality of documents. CONSTITUTION: A user interface(100) comprises a document input interface(110) requesting SGML/XML DTD and an input of a document to a SGML/XML document managing system(200), and a document output interface(120) requests a restore of a document by transmitting a storing portion identifier to the SGML/XML document managing system(200) and presents data of the document to a user. The elements of the SGML/XML document managing system(200) are described as follows. A SGML/XML parser(210) provides a verification of an error and a parsing result to a main memory as a tree form by performing a parsing with respect to the SGML/XML DTD and the document. A SGML/XML document storing device(220) receives the parsing result and stores an entity referring portion included in the document. An entity storing device(230) stores an entity declaration of the SGML/XML DTD. A SGML/XML entity managing device(240) receives the storing portion identifier from a client and returns data of an entity object corresponded to the storing portion identifier. A database system interface(250) is provided to interwork to a database system(300). The database system(300) charges a role of the lower storing system. A SGML/XML database(400) stores the document and the entity.
Abstract:
본 발명은 비구조 문서에서 사용자가 요구하는 정보를 추출하는 장치 및 그 방법에 관한 것이다. 본 발명은 사용자가 추출정보 명세부(10)에 추출하고자 하는 정보를 지정하여 입력하면, 이벤트 템플릿 추출부(20)가 입력된 정보추출용 문서(21)에 추출정보 명세부(10)에 지정된 정보가 포함되어 있는가를 판별하여 특정한 논항구조로 된 문장단위의 이벤트 템플릿을 추출하고, 이벤트 템플릿 통합부(30)가 이벤트 템플릿들을 논항구조와 그 내용의 일치 여부에 따라서 서로 통합한 후, 템플릿 추출부(40)가 통합된 이벤트 템플릿들 중에서 사용자가 추출하고자 하는 정보만을 보유한 템플릿을 추출하여 데이터 베이스(41)에 저장하도록 되어 있으며, 이에 따라서, 인터넷이나 회사에서 보유하고 있는 문서의 구조를 알 수 없는 일반 한국어 문서에서 특정 영역의 정보 구축을 최소화하면서 원하는 정보를 용이하게 추출할 수 있으며, 특히 사용자가 접근할 수 있는 정보의 양을 넓히면서 원하는 정보에 접근하는 시간을 줄 일 수 있다.
Abstract:
PURPOSE: A system and a method for guiding a standard sentence pattern are provided to guide a user in a standard sentence pattern in real time by consulting the standard Korean sentence pattern. CONSTITUTION: The system includes an input unit(110) through which a user inputs a sentence, a morpheme analyzing unit(120) to divide the inputted sentence into the syllables and to analyze the morphemes composing the syllables, a vocabulary information extracting unit(130) to extract the vocabulary information of the analyzed sentence, a standard sentence pattern rule determining unit(140) to determine standard sentence pattern rules and apply the rules, and an output unit(170) to output a standardized sentence pattern of the inputted sentence depending on the applied standard sentence pattern rules. The standard sentence pattern rule determining unit applies the most proper one among the Korean standard sentence pattern rules loaded and pre-stored in the system. A standard sentence pattern is verified and deducted by using the abbreviation deduction patterns sought by an abbreviation deduction pattern search unit(150).
Abstract:
PURPOSE: A device for servicing custom-made one-stop information of a mobile user and a method therefor are provided to share various services by using a user-demand technology, and to supply one integrated service, thereby dynamically generating various complex services and contents according to user taste. CONSTITUTION: A mobile terminal(100) receives an OSS(One-Stop Service) by accessing a mobile web application server. A screen processing module(210) consists of java server page and java servlet for the OSS generated in mobile web application server page type. A service processor(220) generates an OSS log, and displays a corresponding expert site in service bean type. An external service(230) generates a mobile web application server system, and performs an OSS contents connection. An external system(300) supplies study and contents services according to various information supplied from the external service(230). A site supplying system(500) supplies the corresponding expert site.
Abstract:
PURPOSE: A system for searching an XML document and a method thereof are provided to search contents and a structure with respect to a user's query integrally from indexed information by integrally indexing contents and a structure with respect to an XML document. CONSTITUTION: A DTD(Document Type Definition) reduction unit(200) is provided for reducing a complicated DTD to a simple DTD for being used in an index and a search and making an index config file. An index unit(210) is provided for receiving the config file and an XML document made at the DTD reduction unit(200) for an index. An index information storing unit(230) is provided for receiving and storing index information from the index unit(210). A search unit is provided for receiving and searching a general query and a structure query from a user. An index document converting unit(211) receives the XML document and the config file, performs a parsing of the XML document, and makes a file for an index. A morpheme interpreting unit(212) is provided for interpreting a morpheme with respect to an index file made by the index document converting unit(211). An index language extracting unit(213) for extracting an index language in a result of the morpheme interpreting unit(212). An element and position information extracting unit(214) is provided for extracting element information and position information of the index language extracted in the index language extracting unit(213).