Abstract:
A standard filler, or garbage model, is calculated for the detection of out of vocabulary utterances. Gobal statistical parameters are calculated (116) based upon the statistical parameters for new training data and a garbage model (118) is updated based upon the global statistical parameters. This is carried out on-line while the user is enrolling the vocabulary. The garbage model is preferably an average speaker model, representative of all the speech data enrolled by the user to date. The garbage model is preferably obtained as a by-product of the vocabulary enrollment procedure and is similar in it characteristics and topology to all the other regular vocabulary HMMs.
Abstract:
In a statistical based speech recognition system, one of the key issues is the selection of the Hidden Markov Model that best matches a given sequence of feature observations. The problem is usually addressed by the calculation of the maximum likelihood, ML, state sequence by means of a Viterbi or other decoder. Noise or inadequate training can produce an ML sequence associated with a Hidden Markov Model other than the correct model. The method of the present invention provides improved robustness by combining the standard ML state sequence score (416) with an additional path core (418) derived from the dynamics of the ML score as a function of time. These two scores, when combined, form a hybrid metric (420) that, when used with the decoder, optimizes selection of the correct Hidden Markov Model (422).
Abstract:
The present invention provides a method of calculating, within the framework of a speaker dependent system, a standard filler, or garbage model, for the detection of out-of-vocabulary utterances. In particular, the method receives new training data in a speech recognition system (202); calculates statistical parameters for the new training data (204); calculates global statistical parameters based upon the statistical parameters for the new training data (206); and updates a garbage model based upon the global statistical parameters (208). This is carried out on-line while the user is enrolling the vocabulary. The garbage model described in this disclosure is preferably an average speaker model, representative of all the speech data enrolled by the user to date. Also, the garbage model is preferably obtained as a by-product of the vocabulary enrollment procedure and is similar in it characteristics and topology to all the other regular vocabulary HMMs.
Abstract:
A method of providing information storage by means of Automatic Speech Recognition through a communication device of a vehicle comprises establishing a voice communication between an external source and a user of the vehicle, receiving information from the external source, processing the received information using an Automatic Speech Recognition unit in the vehicle and storing the recognized speech in textual form for future retrieval or use.
Abstract:
In a statistical based speech recognition system, one of the key issues is the selection of the Hidden Markov Model that best matches a given sequence of feature observations. The problem is usually addressed by the calculation of the maximum likelihood, ML, state sequence by means of a Viterbi or other decoder. Noise or inadequate training can produce a ML sequence associated with a Hidden Markov Model other than the correct model. The method of the present invention provides improved robustness by combining the standard ML state sequence score (416) with an additional path score (418) derived from the dynamics of the ML score as a function of time. These two scores, when combined, form a hybrid metric (420) that, when used with the decoder, optimizes selection of the correct Hidden Markov Model (422).
Abstract:
In a statistical based speech recognition system, one of the key issues is the selection of the Hidden Markov Model that best matches a given sequence of feature observations. The problem is usually addressed by the calculation of the maximum likelihood, ML, state sequence by means of a Viterbi or other decoder. Noise or inadequate training can produce a ML sequence associated with a Hidden Markov Model other than the correct model. The method of the present invention provides improved robustness by combining the standard ML state sequence score (416) with an additional path score (418) derived from the dynamics of the ML score as a function of time. These two scores, when combined, form a hybrid metric (420) that, when used with the decoder, optimizes selection of the correct Hidden Markov Model (422).
Abstract:
An apparatus and method for enhancing a handheld communication device via a telematics system in a vehicle is disclosed. Audio received at the handheld device is transferred to the telematics system in the vehicle. The received audio is then analyzed to determine whether it contains speech, and if so, an audio present signal is generated and the received audio is recorded into a memory coupled to the telematics system. The user can then engage the user interface of the telematics system to replay the recorded audio. Bluetooth protocol is preferably used to establish a channel between the handheld device and the telematics system, which can occur automatically when the two are in proximity. Analysis of the received audio preferably comprises use of a voice detector as part of a speech recognition system otherwise used by the telematics system to assess spoken commands. The memory is preferably overwritten with the latest audio sent from the handheld device to the telematics system, such that engaging the telematics system for playback of the recorded audio will repeat only the last audio sent.
Abstract:
A method of providing information storage by means of Automatic Speech Recognition through a communication device of a vehicle comprises establishing a voice communication between an external source and a user of the vehicle, receiving information from the external source, processing the received information using an Automatic Speech Recognition unit in the vehicle and storing the recognized speech in textual form for future retrieval or use.
Abstract:
A method and apparatus for adapting a help menu on a user interface, utilizing an input method such as a speech recognition system, for increased efficiency. A list of menu items is presented on the user interface including an optional menu item to reinstate any previously removed menu items. A user selects an item from the menu, such as a help menu, which can then be removed from the list of menu items in accordance with predetermined criteria. The criteria can include how many times the menu item has been accessed and when. In this way, help menu items that are familiar to a user are removed to provide an abbreviated help menu which is more efficient and less frustrating to a user, particularly in a busy and distracting environment such as a vehicle.