Abstract:
An apparatus and method for enhancing a handheld communication device via a telematics system in a vehicle is disclosed. Audio received at the handheld device is transferred to the telematics system in the vehicle. The received audio is then analyzed to determine whether it contains speech, and if so, an audio present signal is generated and the received audio is recorded into a memory coupled to the telematics system. The user can then engage the user interface of the telematics system to replay the recorded audio. Bluetooth protocol is preferably used to establish a channel between the handheld device and the telematics system, which can occur automatically when the two are in proximity. Analysis of the received audio preferably comprises use of a voice detector as part of a speech recognition system otherwise used by the telematics system to assess spoken commands.
Abstract:
A method of providing information storage by means of Automatic Speech Recognition through a communication device of a vehicle comprises establishing a voice communication between an external source and a user of the vehicle, receiving information from the external source, processing the received information using an Automatic Speech Recognition unit in the vehicle and storing the recognized speech in textual form for future retrieval or use.
Abstract:
A method and apparatus for adapting a help menu on a user interface, utilizing an input method such as a speech recognition system, for increased efficiency. A list of menu items is presented on the user interface including an optional menu item to reinstate any previously removed menu items. A user selects an item from the menu, such as a help menu, which can then be removed from the list of menu items in accordance with predetermined criteria. The criteria can include how many times the menu item has been accessed and when. In this way, help menu items that are familiar to a user are removed to provide an abbreviated help menu which is more efficient and less frustrating to a user, particularly in a busy and distracting environment such as a vehicle.
Abstract:
A method of providing information storage by means of Automatic Speech Recognition through a communication device of a vehicle comprises establishing a voice communication between an external source and a user of the vehicle, receiving information from the external source, processing the received information using an Automatic Speech Recognition unit in the vehicle and storing the recognized speech in textual form for future retrieval or use.
Abstract:
In a statistical based speech recognition system, one of the key issues is the selection of the Hidden Markov Model that best matches a given sequence of feature observations. The problem is usually addressed by the calculation of the maximum likelihood, ML, state sequence by means of a Viterbi or other decoder. Noise or inadequate training can produce an ML sequence associated with a Hidden Markov Model other than the correct model. The method of the present invention provides improved robustness by combining the standard ML state sequence score (416) with an additional path core (418) derived from the dynamics of the ML score as a function of time. These two scores, when combined, form a hybrid metric (420) that, when used with the decoder, optimizes selection of the correct Hidden Markov Model (422).
Abstract:
Assisting a user in dialing a telephone call using voice nametags comprises inputting a telephone number with text. A voice nametag from the text is automatically created for each telephone number using grapheme-to-phoneme conversion. Upon initiation of dialing, a spoken phrase is entered and compared to the stored voice nametags. A confidence level score of a match between the spoken phrase and the representations of the stored voice nametags against at least one threshold is determined. The stored voice nametag with the best match to the spoken phrase is selected, and feedback is provided to the user dependent upon the confidence level of the match, which can include automatically dialing the call. An audio feedback tag may be generated and stored based on the recognition result passing a confidence threshold criterion. Further steps are provided for improving the audio quality of the stored nametag based on signal-to-noise ratio.
Abstract:
A method and apparatus for adapting a help menu on a user interface, utilizing an input method such as a speech recognition system, for increased efficiency. A list of menu items is presented on the user interface including an optional menu item to reinstate any previously removed menu items. A user selects an item from the menu, such as a help menu, which can then be removed from the list of menu items in accordance with predetermined criteria. The criteria can include how many times the menu item has been accessed and when. In this way, help menu items that are familiar to a user are removed to provide an abbreviated help menu which is more efficient and less frustrating to a user, particularly in a busy and distracting environment such as a vehicle.
Abstract:
In a statistical based speech recognition system, one of the key issues is the selection of the Hidden Markov Model that best matches a given sequence of feature observations. The problem is usually addressed by the calculation of the maximum likelihood, ML, state sequence by means of a Viterbi or other decoder. Noise or inadequate training can produce an ML sequence associated with a Hidden Markov Model other than the correct model. The method of the present invention provides improved robustness by combining the standard ML state sequence score (416) with an additional path core (418) derived from the dynamics of the ML score as a function of time. These two scores, when combined, form a hybrid metric (420) that, when used with the decoder, optimizes selection of the correct Hidden Markov Model (422).