Abstract:
정답 집합의 정답문을 확장하여 정답 집합의 정답문 개수가 서로 다를 때에도 번역 품질을 정확히 측정하도록 한 자동 번역기의 번역 품질 측정을 위한 정답 집합 확장 장치 및 방법이 제시된다. 제시된 자동 번역기의 번역 품질 측정을 위한 정답 집합 확장 장치는 복수의 정답 집합들, 확장 사전, 언어 모델, 확장 정답 집합을 저장하는 저장부; 저장부에 저장된 복수의 정답 집합들 및 확장 사전을 근거로 번역 원문에 대한 하나 이상의 정답문을 생성하는 레퍼런스 확장부; 및 정답문들에 대한 사용자 단말의 승인 여부를 근거로 인증에 성공한 정답문들을 이용하여 확장 정답 집합을 생성하고, 생성한 확장 정답 집합을 저장부에 저장하는 레퍼런스 확장 인증부를 포함한다.
Abstract:
The present invention relates to an apparatus for driving an agent system based on a natural language processing operation. The apparatus for driving the agent system detects next intention of a user by using information contained in a conversation content of the user stored during the operation of an automatic translation system installed in a portable terminal, and copes with the detected intention of the user. Therefore, according to the present invention, input and output sentences of the automatic translation system are utilized as context information, so that the convenience in use of the agent system can be improved.
Abstract:
The present invention relates to an apparatus and method for improving Chinese word segmentation performance, and more particularly, an apparatus and method for improving word segmentation performance by processing word segmentation errors of Chinese by automatically recognizing an accurate boundary of a word from a sentence of another language, for example, English or Korean, of a parallel corpus of which a word boundary is clear in order to reduce unregistered word errors and ambiguity errors frequently appeared in a Chinese word segmenting device. According to the present invention, a limitation that errors are confirmed from the word segmenting device by consuming lots of manpower and time can be overcome by continuously extracting the unregistered word errors and ambiguity errors, which are difficult to process at the time of word segmentation of a Chinese sentence, through the parallel corpus and storing corrected word segmentation information.
Abstract:
The present invention suggests a data tagging device capable of linking data determined to be similar by user intuition during cross-correlating, elaborating clustering of the data using the linkage, and tagging the clustered data and a method thereof. The data tagging device suggested by the present invention includes: a cluster generation unit which generates clusters by grouping raw data using a grouping technique; a relation data generation unit which generates relation data between two instances by comparing instances extracted from each cluster upon user intuition; a grouping unit which re-groups the raw data based on the relation data and repeats generation of relation data and grouping of the raw data until conditions are met; and a data tagging unit which adds tag data to representative data of each of finally-generated clusters. According to the present invention, it is possible to improve tagging efficiency by reducing time for and errors in tagging an entity. [Reference numerals] (210) Data; (220) Grouped data; (AA) Repeat; (BB) Provide user data; (CC) Select connection/disconnection of similar examples
Abstract:
According to one embodiment, an automatic learning-based artificial intelligent dialog system includes: a database in which personalized expression learning data including sentence expressions classified by personal intention and classification tags for the sentence expressions is stored; a learning device which analyzes the sentence expressions included in the personalized expression learning data at a morpheme level and learns personal profiling data expressions attached with the classification tags at a morpheme level; a language analysis unit which analyzes the currently-inputted dialog sentence at a morpheme level; an extraction unit which extracts user profiling data based on the analysis result at a morpheme level by the language analysis unit and the personal profiling data expressions; a personal history database in which the user profiling data is classified by personal preference and accumulated as personal profiling data; an intention analysis unit which determines the intention in the dialog sentence based on the analysis result at a morpheme level by the language analysis unit; and a response generation unit which determines a dialog flow of the dialog sentence based on the personal profiling data accumulated in the personal history database and generates a response sentence. [Reference numerals] (10) Database; (20) Learning unit; (30) Language analysis unit; (40) Extraction unit; (50) Database; (60) Intention analysis unit; (70) Response generation unit