Abstract:
A method for enabling a caller to obtain access to one or more services via a telephone network by speaking a password begins by establishing at least one predetermined threshold value for a speaker verification signal. For each spoken character of the password, the method generates a set of parameters using a voice verification feature transformation. After entry and recognition of the password, the sets of parameters are used to generate a speaker verification signal for the password (step 78). Upon the occurence of a predetermined call condition, the predetermined threshold value is adjusted to increase the level of security in the verification algorithm. If the speaker verification signal for the password has a predetermined relationship with respect to the adjusted threshold value (84), the caller's identity is accepted (88). If not, the caller may be asked (90) to answer certain personnal questions before his or her identity is accepted.
Abstract:
A caller is prompted to enunciate (105) an alphanumeric phrase comprising a number of characters. Having reset (104) to zero a cumulative recognition distance register or its equivalent, each character is captured and analyzed (106) and a form of acoustic dissimilarity calculated (108) in relation to a previously stored and corresponding reference character of a reference alphanumeric string. The register is incremented by a quantified difference between the reference character and the captured character and then an assessment (112) is made to determine which of the reference alphanumeric strings has the lowest cumulative recognition distance implying that this was the alphanumeric phrase spoken by the caller. Robust phrase recognition is thus achieved by the exemplified methodology of Figure 6.
Abstract:
A caller is prompted to enunciate (105) an alphanumeric phrase comprising a number of characters. Having reset (104) to zero a cumulative recognition distance register or its equivalent, each character is captured and analyzed (106) and a form of acoustic dissimilarity calculated (108) in relation to a previously stored and corresponding reference character of a reference alphanumeric string. The register is incremented by a quantified difference between the reference character and the captured character and then an assessment (112) is made to determine which of the reference alphanumeric strings has the lowest cumulative recognition distance implying that this was the alphanumeric phrase spoken by the caller. Robust phrase recognition is thus achieved by the exemplified methodology of Figure 6.
Abstract:
The present invention describes a method for recognizing alphanumeric strings spoken over a telephone network (10) wherein individual character recognition need not to be uniformly high in order to achieve high string recognition accuracy. Preferably, the method uses a processing system (14) having a digital processor, an interface (42) to the telephone network (10), and a database (32) for storing a predetermined set of reference alphanumeric strings. In operation, the system (10) prompts the caller to speak the characters of a string, and characters are recognized using a speaker-independent voice recognition algorithm (48). The method calculates recognition distances between each spoken input character and the corresponding letter or digit in the same position within each reference alphanumeric string. After each character is spoken (206), captured and analyzed (208), each reference string distance is incremented (204) and the process is continued, accumulating distances for each reference string, until the last character is spoken. The reference string with the lowest cumulative distance is then declared to be the recognized string (210).