Abstract:
본 발명은 URC(Ubiquitious Robotic Companion) 단말(로봇)을 이용하여 가정 내에서 사용자의 위치와 관계없이 가정 내에 분산된 음악, 영화, 방송, 게임, 개인 미디어 등의 다양한 멀티미디어 컨텐츠를 즐길 수 있도록 지원하는 홈 엔터테인먼트(Home Entertainment) 로봇 서비스에 관한 것이다. 무선 통신을 통해 연결된 URC 단말 및 서버가 홈 네트워크에 연결된 다양한 멀티미디어 기기와 컨텐츠를 통합 관리하고, 이동과 음성입력이 가능한 로봇이 음성호출 기능 및 위치센서를 통해 사용자의 인접거리에서 서비스를 제공함으로써 음성입력을 위해 리모콘을 사용하거나 가정 내에 마이크가 분산 배치될 필요성이 없다. 또한, 가정내의 좌표 정보를 활용해서 로봇과 인접에 있는 사용자의 위치에 맞는 서비스를 제공할 수 있는 것을 특징으로 한다. 음성인식, 음성합성, URC, 홈 네트워크, 멀티미디어 컨텐츠, 멀티미디어 기기, 홈 엔터테인먼트, 정보검색
Abstract:
PURPOSE: An apparatus for inputting a text using a lip reading method in a mobile phone and a method thereof are provided to input a character by recognizing the motion of a user's mouth. CONSTITUTION: A face location tracking unit(210) detects the motion of a user in a photographing device. A lip motion feature detecting unit(220) extracts a specific vector about an area including the lip shape from the face image by the face location tracking unit. A lip motion extracting unit(230) extracts a feature vector by the movement of the lip among the extracted vectors. A lip motion decoding unit(260) changes the feature vector from the lips motion extracting unit into a corresponding character.
Abstract:
본 발명은 암묵신호분리를 이용한 마이크배열 기반 음성인식 시스템 및 그 시스템에서의 목표 음성 추출방법에 관한 것으로서, 음성인식 시스템은 다수의 마이크를 통해 각각 입력된 혼합신호들을 독립요소분석을 통해 분리하고, 상기 분리된 음원 신호들 중에서 음성인식을 목표로 발성된 하나의 목표음성을 가우시안 혼합 밀도 모델 또는 은닉 마르코프 모델을 이용하여 추출하고, 상기 추출된 목표음성을 통해 원하는 음성을 자동으로 인식함으로써, 잡음이 존재하는 상황에서도 보다 높은 인식률을 확보할 수 있다. 마이크배열, 음성인식, 암묵신호분리, 독립요소분석(ICA), 가우시안 혼합 밀도 모델(GMM), 은닉 마르코프 모델(HMM), 목표음성, 특징벡터, 대수 우도비(LLR).
Abstract:
본 발명은 차량용 네비게이션 단말기의 음성인식용 발화 이형태 생성을 위한 POI(points of interest) 대상, 복합명사 분해 및 태깅(tagging) 방법을 제시한다. 소형 차량 네비게이션 단말기 탑재 음성 인식 엔진은 일반적으로 고립어를 인식 대상으로 한다. 고립어는 지도상의 특정 지점에 대한 명칭이며, 이러한 명칭에 대해 사용자는 다양한 발화 이형태를 가진다. 본 발명은 사용자의 다양한 발화 이형태 생성을 위해, 지역 명칭으로 기술된 복합명사 형태의 어휘를 대상으로 복합 명사 분해 및 태깅 방법론을 제시한다. 분해는 차트 기반 동적 프로그래밍 방법론을 기반으로 하고, 태깅은 최대 엔트로피를 기반으로 하여 POI명칭을 구성하는 단일어 각각에 대한 의미 표지를 부착한다. 복합명사, 복합명사분해, 태깅, POI, 이형태
Abstract:
PURPOSE: A speaker adaptation apparatus and a method for speech recognition are provided to remarkably improve the performance of a speaker by estimating the answer of actual speech data in high possibility through an N-best recognition result screen output function. CONSTITUTION: A voice data verification unit(202) obtains measurement data for each phoneme with regard to accumulation data through reliability evaluation. The accumulation data includes voice data and N-best recognition result data. A sound model speaker adapting unit(204) performs speaker adaptation by measurement data for each acquired per-phoneme. A sound model updating unit(206) updates a sound model by a new speaker-subordinate sound model through performed speaker adaptation.
Abstract:
PURPOSE: A method for environment adaptation using discrimination training based on channel estimation is provided to find channel characteristic about adaptation data maintaining discrimination primarily and perform model conversion and combine converted model with discrimination learning technique thereby providing effective environmental adaptation. CONSTITUTION: A noise removing unit(110) eliminates noise component within training data(101). A base recognition performing unit(130) recognizes adaptive data(103). A channel characteristic estimator obtains statistical model about phoneme unit by right data(104) of the adaptation data. The channel characteristic estimator combines the statistical model to base sound model(102). A discrimination environment adapting unit(150) outputs adaptive sound model(106) after change of the statistical model by adaptation of discrimination learning technique.
Abstract:
PURPOSE: A remote controller, a method and an apparatus for controlling an input interface are provided to enable a user to conveniently input a Hangul, English, number and symbol character through a keypad. CONSTITUTION: An input keypad(1100) combines two keys among a number key, an asteroid key, a sharp key, a directional key and a special character key. The input keypad selects one of input among the Hangul, English and number characters and symbol, and a control unit(1200) recognizes a key operation through the input keypad. The control unit process a key signal corresponding to the recognized key operation, and a wireless transmission unit(1400) transmits the key signal processed in the control unit.
Abstract:
PURPOSE: A rejection apparatus and a method of a garbage and anti-word model base in voice recognition are provided to effectively reject various operating noise or an unenrolled word by implementing a rejection process about a recognized word. CONSTITUTION: An extracting unit(104) extracts a feature vector from a voice signal. A searcher(110) gives a score through a pattern matching about the feature vector and outputs a recognition result. A rejection network generator(114) generates 'the rejection network for a rejection evaluation' through the recognition result. A rejection searcher(124) outputs a recognition score of 'word model comprising the rejection network' based on a garbage sound model. A decision logic unit(128) decides the rejection about the recognized word comparing with the recognition scores.
Abstract:
PURPOSE: A home network service method using a ubiquitous intelligent robot for offering a service for a location of a user and a robot for the coordinate information are provided to no need to use a remote controller by supplying robot performing voice input through a location sensor. CONSTITUTION: User interface information is inputted through a ubiquitous intelligent robot. The inputted user interface information is transmitted to the ubiquitous intelligent robot server(S300, S302). The ubiquitous intelligent robot server refers to the multimedia device having the multimedia information corresponding to the user interface information from a home network device group(S304). If the multimedia device is detected, the information search result user interface information is outputted through the ubiquitous intelligent robot.
Abstract:
PURPOSE: A multiple recognition candidate formation apparatus and a method thereof are provided, which can improve the efficiency of the voice recognition engine by reducing the usage amount of a memory unit and search time for creating the multiple recognition candidate. CONSTITUTION: A voice feature extractor(502) creates the feature vector through the voice recognition about the consecutive numbers voice. A search unit(504) creates the single recognition candidate string through the pattern recognition about the feature vector. The search unit outputs the likelihood point and feature vector about discrete numerical sound composed of the single recognition candidate string. A multiple recognition candidate generation part(508) creates the multiple recognition candidate by referring the order by numerical sound of the confidence measure generator(506) and the pre-set confusion matrix.