Abstract:
The present invention discloses a method for identifying an identity, device and a communication terminal. The method includes that: a voiceprint feature of a current call object and a mobile phone number of the current call object are extracted; the identity of the current call object is identified according to the voiceprint feature and the mobile phone number. The present invention solves the problem in the related art that it is difficult to effectively identify the identity of a call object, thus providing a method for effectively identifying the identity of a current call object and technically reducing the probability of phone fraud on a user.
Abstract:
A voice recognition server 200 has a voice reception unit 202 which receives a voice from a telephone equipment 100, a model storage unit 208 which stores at least one acoustic model and at least one language model used for converting the voice received by the voice reception unit 202, to character data, a number decision unit 204 which decides a current calling number and a second number of the telephone equipment 100, a model selection unit 206 which selects an acoustic model stored in the model storage unit 208, based on the current calling number and the second number, and which selects a language model stored in the model storage unit 208, based on the current calling number, and a voice recognition unit 210 which converts the voice received by the voice reception unit 202, to character data, based on the acoustic model and the language model selected by the model selection unit 206.
Abstract:
An advanced telecommunications system is provided for the recognizing of spoken commands over a cellular telephone (15), satellite telephone (14), or personal communications network (16). In the cellular application, for example, a Speech Recognition System (20) interconnects either internally with or as an external peripheral to a cellular telecommunications switch (12). The Speech Recognition System (20) includes an administrative subsystem (21), a call processing subsystem (23), a speaker-dependent recognition subsystem (25), a speaker-independent recognition subsystem (27), and a data storage subsystem (29). Pre-recorded instructional messages are stored in the memory of the call processing subsystem (23) for instructing a user on his or her progress in using the system. The speaker-independent recognition subsystem (27) allows the user to interact with the system employing non-user specific functions. User specific functions are controlled with the speaker-dependent recognition subsystem (25). User specific attributes collected by the recognition subsystems are stored in the data storage subsystem (29).
Abstract:
Improved systems and methods are provided for transcribing audio files of voice mails sent over a unified messaging system. Customized grammars specific to a voice mail recipient are created and utilized to transcribe a received voice mail by comparing the audio file to commonly utilized words, names, acronyms, and phrases used by the recipient. Key elements are identified from the resulting text transcription to aid the recipient in processing received voice mails based on the significant content contained in the voice mail.
Abstract:
The invention provides an apparatus for automatically preparing the minutes of a conference. When the voices of statements of plural speakers are entered into voice input means 12 in a state in which the vocalization data are registered in vocalization memory means 11 for each identification data of the plural speakers, the speaker and content of statement are recognized according to the registered data. The recognized content of statements is edited, together with the identification data of the speakers, by record preparation means 14 into a predetermined form to provide minutes data, whereby the minutes data of the content of statements of the plural speakers can be automatically prepared, on real time basis, edited in a predetermined form.
Abstract:
An advanced telecommunications system is provided for the recognizing of spoken commands over a cellular telephone (15), satellite telephone (14), or personal communications network (16). In the cellular application, for example, a Speech Recognition System (20) interconnects either internally with or as an external peripheral to a cellular telecommunications switch (12). The Speech Recognition System (20) includes an administrative subsystem (21), a call processing subsystem (23), a speaker-dependent recognition subsystem (25), a speaker-independent recognition subsystem (27), and a data storage subsystem (29). Pre-recorded instructional messages are stored in the memory of the call processing subsystem (23) for instructing a user on his or her progress in using the system. The speaker-independent recognition subsystem (27) allows the user to interact with the system employing non-user specific functions. User specific functions are controlled with the speaker-dependent recognition subsystem (25). User specific attributes collected by the recognition subsystems are stored in the data storage subsystem (29).
Abstract:
A system (105) and method provides universal access to voice-based documents containing information (300, 400, 500) formatted using MIME and HTML standards using customized extensions for voice information access and navigation. These voice documents are linked using HTML hyper-links that are accessible to subscribers using voice commands, touch-tone inputs and other selection means (600, 700). These voice documents and components in them are addressable using HTML anchors embedding HTML universal resource locators (URLs) rendering them universally accessible over the Internet (101). This collection of connected documents forms a voice web. The voice web includes subscriber-specific documents including speech training files (431, 432, 433) for speaker dependent speech recognition, voice print files for authenticating the identity of a user (408) and personal preference and attribute files (531, 532, 533) for customizing other aspects of the system in accordance with a specific subscriber.
Abstract:
An advanced telecommunications system is provided for the recognizing of spoken commands over a cellular telephone (15), satellite telephone (14), or personal communications network (16). In the cellular application, for example, a Speech Recognition System (20) interconnects either internally with or as an external peripheral to a cellular telecommunications switch (12). The Speech Recognition System (20) includes an administrative subsystem (21), a call processing subsystem (23), a speaker-dependent recognition subsystem (25), a speaker-independent recognition subsystem (27), and a data storage subsystem (29). Pre-recorded instructional messages are stored in the memory of the call processing subsystem (23) for instructing a user on his or her progress in using the system. The speaker-independent recognition subsystem (27) allows the user to interact with the system employing non-user specific functions. User specific functions are controlled with the speaker-dependent recognition subsystem (25). User specific attributes collected by the recognition subsystems are stored in the data storage subsystem (29).