Abstract:
A method for recognizing a spoken word in the presence of interfering speech, such as a system-generated voice prompt, begins by echo cancelling the voice prompt and any detected speech signal to produce a residual signal. Portions of the residual signal that have been most recently echo-cancelled are then continuously stored in a buffer. The energy in the residual signal is also continuously processed to determine onset of the spoken word. Upon detection of word onset, the portion of the residual signal then currently in the buffer is retained, the voice prompt is terminated, and the recognizer begins realtime recognition of subsequent portions of the residual signal. Upon detection of word completion, the method retrieves the portion of the residual signal that was retained in the buffer upon detection of word onset and performs recognition of that portion. The recognized portions of the word are then reconstructed to determine the spoken word.
Abstract:
A method for recognizing a spoken word in the presence of interfering speech, such as a system-generated voice prompt, begins by echo cancelling the voice prompt and any detected speech signal to produce a residual signal. Portions of the residual signal that have been most recently echo-cancelled are then continuously stored in a buffer. The energy in the residual signal is also continuously processed to determine onset of the spoken word. Upon detection of word onset, the portion of the residual signal then currently in the buffer is retained, the voice prompt is terminated, and the recognizer begins realtime recognition of subsequent portions of the residual signal. Upon detection of word completion, the method retrieves the portion of the residual signal that was retained in the buffer upon detection of word onset and performs recognition of that portion. The recognized portions of the word are then reconstructed to determine the spoken word.
Abstract:
A method for recognizing a spoken word in the presence of interfering speech, such as a system-generated voice prompt, begins by echo cancelling the voice prompt and any detected speech signal to produce a residual signal. Portions of the residual signal that have been most recently echo-cancelled are then continuously stored in a buffer. The energy in the residual signal is also continuously processed to determine onset of the spoken word. Upon detection of word onset, the portion of the residual signal then currently in the buffer is retained, the voice prompt is terminated, and the recognizer begins realtime recognition of subsequent portions of the residual signal. Upon detection of word completion, the method retrieves the portion of the residual signal that was retained in the buffer upon detection of word onset and performs recognition of that portion. The recognized portions of the word are then reconstructed to determine the spoken word.
Abstract:
A method for recognizing a spoken word in the presence of interfering speech beings by echo-cancelling the voice prompt and any detected speech signal to produce a residual signal (60). Portions of the residual signal that have been most recently echo-cancelled are then continuously stored in a buffer (62). The energy in the residual signal is also continuously processed to determine onset of the spoken word (64). Upon detection of word onset, the portion of the residual signal then currently in the buffer is retained, the voice prompt is terminated, and the recognizer begins realtime recognition of subsequent portions of the residual signal (66). Upon detection of word completion (68), the method retrieves the portion of the residual that was retained in the buffer upon detection of word onset (70) and performs recognition of that portion (72).
Abstract:
A method for recognizing a spoken word in the presence of interfering speech, such as a system-generated voice prompt, begins by echo cancelling the voice prompt and any detected speech signal to produce a residual signal. Portions of the residual signal that have been most recently echo-cancelled are then continuously stored in a buffer. The energy in the residual signal is also continuously processed to determine onset of the spoken word. Upon detection of word onset, the portion of the residual signal then currently in the buffer is retained, the voice prompt is terminated, and the recognizer begins realtime recognition of subsequent portions of the residual signal. Upon detection of word completion, the method retrieves the portion of the residual signal that was retained in the buffer upon detection of word onset and performs recognition of that portion. The recognized portions of the word are then reconstructed to determine the spoken word.
Abstract:
A method for recognizing a spoken word in the presence of interfering speech, such as a system-generated voice prompt, begins by echo cancelling the voice prompt and any detected speech signal to produce a residual signal. Portions of the residual signal that have been most recently echo-cancelled are then continuously stored in a buffer. The energy in the residual signal is also continuously processed to determine onset of the spoken word. Upon detection of word onset, the portion of the residual signal then currently in the buffer is retained, the voice prompt is terminated, and the recognizer begins realtime recognition of subsequent portions of the residual signal. Upon detection of word completion, the method retrieves the portion of the residual signal that was retained in the buffer upon detection of word onset and performs recognition of that portion. The recognized portions of the word are then reconstructed to determine the spoken word.