Abstract:
A system and method for enabling a caller to obtain access to services via a telephone network by entering a spoken first character string having a plurality of digits. Preferably, the method includes the steps of prompting the caller to speak the first character string beginning with a first digit and ending with a last digit thereof, recognizing each spoken digit of the first character string using a speaker-independent voice recognition algorithm (49a, 69a), then following entry of the last digit of the first string, initially verifying the caller's identity using a voice verification algorithm (49b, 66b). After initial verification, the caller is again prompted to enter a second character string, which must also be recognized (49c, 66c) before access is effected.
Abstract:
An advanced electronic vehicle alarm system allows control of alarm functions to be accomplished using specific spoken commands. A microphone (14) converts speech into time-variant voltage levels which are amplified and sent to a analog-to-digital converter and digitized. The digitized data is then processed by a speech recognition subsystem (10). The speech recognition subsystem (10) separates extraneous speech from words and provides corresponding output signals when control words are recognized. The output signals are preferably employed by the alarm system to operate door locking and unlocking controls (26), to operate a loud audible siren (20) and/or horn (22), to operate vehicle light controls (24), to provide engine (28) cut-off control, to provide engine (28) starting control or to operate a response indicator incorporated in the main alarm processing unit (12). The response indicator provides verbal responses to confirm spoken commands. A speaker verification capability is also included in the speech recognition subsystem (10) to allow for secured operation of the vehicle alarm system.
Abstract:
A method for recognizing a spoken word in the presence of interfering speech, such as a system-generated voice prompt, begins by echo cancelling the voice prompt and any detected speech signal to produce a residual signal. Portions of the residual signal that have been most recently echo-cancelled are then continuously stored in a buffer. The energy in the residual signal is also continuously processed to determine onset of the spoken word. Upon detection of word onset, the portion of the residual signal then currently in the buffer is retained, the voice prompt is terminated, and the recognizer begins realtime recognition of subsequent portions of the residual signal. Upon detection of word completion, the method retrieves the portion of the residual signal that was retained in the buffer upon detection of word onset and performs recognition of that portion. The recognized portions of the word are then reconstructed to determine the spoken word.
Abstract:
An advanced telecommunications system is provided for the recognizing of spoken commands over a cellular telephone (15), satellite telephone (14), or personal communications network (16). In the cellular application, for example, a Speech Recognition System (20) interconnects either internally with or as an external peripheral to a cellular telecommunications switch (12). The Speech Recognition System (20) includes an administrative subsystem (21), a call processing subsystem (23), a speaker-dependent recognition subsystem (25), a speaker-independent recognition subsystem (27), and a data storage subsystem (29). Pre-recorded instructional messages are stored in the memory of the call processing subsystem (23) for instructing a user on his or her progress in using the system. The speaker-independent recognition subsystem (27) allows the user to interact with the system employing non-user specific functions. User specific functions are controlled with the speaker-dependent recognition subsystem (25). User specific attributes collected by the recognition subsystems are stored in the data storage subsystem (29).
Abstract:
A toll call service center accessible by subscribers to the service. Subscribers have their own set of identifiers uniquely associated with a plurality of called stations desired to be accessed. The center comprises a voice command system (10) having a storage (16) for storing representations of the identifiers at storage locations uniquely identified with the subscriber. Command system (10) also includes an announcing circuit and a processor (30). Processor (30) is responsive to reception of an incoming call from a subscriber for controlling the announcing circuit to request the subscriber to speak an identifier associated with one of the plurality of called stations. A voice recognizer (48) is responsive to receipt of the identifier for determining whether the identifier is valid. If the identifier is recognized as belonging to the subscriber, the processor controls an outdialer (24) to dial the telephone number of the called station associated with the identifier.
Abstract:
A method for enabling a caller to obtain access to one or more services via a telephone network by speaking a password begins by establishing at least one predetermined threshold value for a speaker verification signal. For each spoken character of the password, the method generates a set of parameters using a voice verification feature transformation. After entry and recognition of the password, the sets of parameters are used to generate a speaker verification signal for the password (step 78). Upon the occurence of a predetermined call condition, the predetermined threshold value is adjusted to increase the level of security in the verification algorithm. If the speaker verification signal for the password has a predetermined relationship with respect to the adjusted threshold value (84), the caller's identity is accepted (88). If not, the caller may be asked (90) to answer certain personnal questions before his or her identity is accepted.
Abstract:
A toll call service center of the present invention is accessible by subscribers to a toll call service. One or more of the subscribers of the service have their own set of identifiers uniquely associated with a plurality of called stations desired to be accessed by the subscriber. The center comprises a voice command system having a storage for storing representations of the identifiers at storage locations uniquely identified with the subscriber. The command system also includes an announcing circuit and a processor operative under the control of a program stored therein. The processor is responsive to reception of an incoming call from a subscriber for controlling the announcing circuit to request the subscriber to speak an identifier associated with one of the plurality of called stations. A voice recognizer is responsive to receipt of the identifier for determining whether the identifier is valid. If the identifier is recognized as belonging to the subscriber, the processor controls an outdialer to dial the telephone number of the called station associated with the identifier.
Abstract:
Voice activated dialing is described for use in a mobile telecommunications system. A voice input is received from a wireless network user. A telephone number to be dialed is determined by using speaker independent speech recognition to interpret a string of spoken digits in the voice input to determine the telephone number, or using speaker dependent speech recognition to interpret a spoken word in the voice input to determine the telephone number. A telephone call is then initiated by dialing the telephone number.
Abstract:
A method for recognizing a spoken word in the presence of interfering speech, such as a system-generated voice prompt, begins by echo cancelling the voice prompt and any detected speech signal to produce a residual signal. Portions of the residual signal that have been most recently echo-cancelled are then continuously stored in a buffer. The energy in the residual signal is also continuously processed to determine onset of the spoken word. Upon detection of word onset, the portion of the residual signal then currently in the buffer is retained, the voice prompt is terminated, and the recognizer begins realtime recognition of subsequent portions of the residual signal. Upon detection of word completion, the method retrieves the portion of the residual signal that was retained in the buffer upon detection of word onset and performs recognition of that portion. The recognized portions of the word are then reconstructed to determine the spoken word.