Abstract:
PURPOSE: An apparatus for filtering a noise using a statistical model in a voice signal is provided to improve the wiener filter performance by restoring voice signal using a joint density GMM(Gaussian Mixture Model). CONSTITUTION: An apparatus for filtering a noise using a statistical model comprises a clean signal spectrum vector estimating unit(214), a post SNR estimating unit(216), a transfer function estimating unit(218), and a noise filtering unit(220). The clean signal spectrum vector estimating unit estimates spectrum vector of a clean signal using a PSD(Power Spectrum Density), a PSD estimation information of the estimated input signal, and a preset statistical model. The noise filtering unit performs noise filtering using the transfer function and fast fourier transformed frequency axis complex signal.
Abstract:
PURPOSE: A method and a server servicing IPTV broadcast and an IPTV set top apparatus are provided to search the large amount of IPTV broadcasting data by searching broadcasting data in real time through voice speech. CONSTITUTION: A voice recognition list transceiver(103) transmits and receives voice recognition list request information of an update scheduling unit(101). A voice recognition unit(105) recognizes a voice signal in an IPTV set-top box(100), and a voice search request transmitter(106) transmits a recognized string to a voice search request receiver. A voice search result receiver(107) provides the broadcasting data of a broadcasting data transmitter to a display unit(108).
Abstract:
PURPOSE: A method for separating a source signals and an apparatus thereof are provided to improve the recording, transmission and recognition performances by separating only desirable sound source signal in plural sound source environments. CONSTITUTION: Fourier transformer(10) transforms a mixed input signal(S1) into each channel frequency domain through Fourier transformation. A frequency bandwidth divider(20) constitutes a frequency cluster from the each frequency domain. A frequency domain signal divider(30) applies a blind source separation for each cluster frequency domain. A reverse Fourier transformer(40) integrates the spectrums of divided signals through reverse Fourier transformation.
Abstract:
PURPOSE: A speech improving apparatus and a speech recognition system and method are provided to improve the voice recognition performance of a voice recognition system in a movable body of small resources by performing signal decoding through a sound model database. CONSTITUTION: A speed level divider(100) measures a moving speed level of a movable body through an inputted noise signal inputted in an initial stage of voice recognition. When the speed level of the movable body is lower than a predetermined value, a first sound quality improvement unit(112) improves the sound quality of a voice signal inputted by a Wiener filter. If the speed level of the movable body exceeds a predetermined value, a second sound quality improvement unit(114) improves the sound quality of a voice signal inputted by a GMM(Gaussian Mixture Model).
Abstract:
PURPOSE: A CMS(Cepstrum Mean Subtraction) method and a device thereof are provided to accurately normalize a channel property by estimating an average CMS value of the real voice section based on the CMS average value of a mute section. CONSTITUTION: A property extractor(200) extracts the properties of a mute section before a start point, a sound section, and a mute section after a finish point. A firing unit CMS value calculator(600) calculates an actual firing unit cepstrum average about the entire sound section. A cepstrum average estimator(300) estimates the cepstrum average of the entire section based on the properties of the mute section. A property vector CMS applier(400) performs channel-normalization of the estimated average. A decoder decodes the channel-normalized MFCC property vector.
Abstract:
PURPOSE: A voice recognition method of a vehicle navigation terminal is provided to generate voice emitting isoform through a simple pattern construction using a resolute/tagged result by presenting a meaning classification system for POI name domain. CONSTITUTION: A voice recognition method of a vehicle navigation terminal is as follows. The points of interest(POI) list and POI learning data are recognized from the voice information of a voice emitting isoform input to the vehicle navigation terminal (S200). A resource is built on the POI list and the POI learning data recognized(S202). The resolution and tagging on the built resource are performed with the POI list(S204). The result resolved and tagged is created as POI database(S206). Simplex/analyzed database is built based on the POI list and the POI learning data. N-gram vocabulary is extracted from the POI learning data.
Abstract:
PURPOSE: A noise cancelling apparatus is provided to estimate a clean voice more accurately in an environment in which a dynamic noise and various noises are mixed. CONSTITUTION: A noise cancelling apparatus comprises a noise estimation module(200) which calculates the estimation value of a noise signal in the current frame of a voice signal, a Wiener filter module(202) which receives the voice signal and calculates an intermediate result by applying the intermediate Wiener filter, a database(206) in which Gaussian mixed-model data is stored, and an MMSE estimation module(204) which calculates the estimation value of a clean voice by using the Gaussian mixed-model data and intermediate result.
Abstract:
An apparatus and a method for generating a response sentence are provided to perform exact meaning analysis of a voice-recognized sentence by performing second point of sentence/substitutes extraction and second meaning analysis with respect to the voice-recognized sentence. A response sentence generating method comprises the following steps of: performing morpheme analysis of a voice-recognized sentence(200,210); extracting a first point of sentence from the sentence(220); performing first meaning analysis of the sentence based on the extracted first point of sentence(230); extracting a second point of sentence including the first point of sentence from the sentence based on the first meaning analysis result in order to further extract point of sentence which are not extracted in the above second step(240); generating a meaning analysis result of the voice-recognized sentence by performing second meaning analysis of the sentence based on the extracted second point of sentence(250); and generating a response sentence to the voice-recognized sentence based on the generated meaning analysis result(260).
Abstract:
PURPOSE: A device and a method for recognizing an object name on a Korean text are provided to recognize the object name on the Korean text through the reinforced learning using a co-training based on the HMM(Hidden Markov Model). CONSTITUTION: A morpheme parser(10) separates the input text into a list of sentences, forms a morpheme list separating each sentence into the morpheme unit tagging a state label, generates/stores an HMM data structure in a memory. A statistics information extractor(20) extracts the HMM object name statistics information from an object name tagged text set. A co-training learning device(40) extends the statistics information through the current learning data by extracting the HMM object name statistics information from an unlabeled text set as advancing the co-training learning based on the HMM statistics information extracted from the object name tagged text set. An object name recognizer(50) recognizes the object name by deciding an optimal HMM object name statistics information path of the morphemes forming the sentences of the input text through a viterbi algorithm.