Abstract:
A method of speaker identification for audio content being of a format based on multiple channels is disclosed. The method comprises: extracting, from a first audio clip in the format, a plurality of spatial acoustic features across the multiple channels and location information, the first audio clip containing voices from a speaker (S201), constructing a first model for the speaker based on the spatial acoustic features and the location information, the first model indicating a characteristic of the voices from the speaker (S202), identifying whether the audio content contains voices from the speaker based on the first model (S203). Corresponding system and computer program product are also disclosed.
Abstract:
A method for displaying visual information about participants in a teleconference comprises mixing of audio signals originating from participants in the teleconference, providing an automatic identification of a participant currently speaking and displaying at least one static digital image associated with the identified participant currently speaking at least during a part of the time while this participant is speaking.
Abstract:
Ein Verfahren zur Bereitstellung von in einer Konferenz erzeugten Daten, bei dem Sprachsignale von Teilnehmern der Konferenz in einer Konferenzbrücke gemischt werden, umfasst ein Bereitstellen einer über die Dauer der Konferenz mitlaufenden Zeitbasis und ein Einrichten einer automatischen Identifikation jedes Teilnehmers, wenn dieser Teilnehmer in der Konferenz spricht. Das Verfahren umfasst weiter ein Erfassen eines Gesprächsbeitrags jedes sprechenden Teilnehmers zu einem in der Konferenz geführten Gespräch der Teilnehmer als jedem sprechenden Teilnehmer zugeordnete Sprechdauer in der Konferenz, ein Zuordnen eines Zeitstempels zu der Sprechdauer, und ein Erzeugen statistischer Daten durch eine statistische Auswertung der Sprechdauern der Teilnehmer.
Abstract:
The invention refers to a Method of informing a person of an event comprising carrying out in a computing system the steps of: receiving (10) information of an event (1); determining (11 ) a specific person which is to be notified of the event (1); performing (12) an information action with the aim of informing the specific person of the event (1 ) via a telecommunications system; receiving (15) a voice utterance of a person; verifying (16) that the identity of the person coincides with that of the specific person based on the received voice utterance using biometric voice data.
Abstract:
Performing speaker recognition on voice-mail messages for grouping or sorting of voicemails according to sender. Using the terminology of the application: a method for grouping voice messages includes extracting a voice signature from a voice message and tagging the voice message with an identification associated with the voice signature. The method also includes grouping the voice message based on the identification.
Abstract:
Embodiments of the present invention are directed generally to use of biometric identification during a call for detecting an anomaly occurring in the call, such as a change in the parties (11, 13) participating on the call. Communication between parties of a call (11, 13) is monitored and biometric identification is performed using the communication. According to one exemplary embodiment, biometric prints (105), such as voice prints (105A), face prints, etc., are obtained for parties (11, 13) that are authorized to participate on a call. The call is then monitored and biometric data (105) (e.g., audio, video, etc.) captured from communication during the call is compared with the biometric prints of the authorized parties (11, 13) to detect changes in the parties participating on the call, such as a new, unauthorized party joining the call. Thus, a call processing system (10) can detect anomalies occurring during monitored calls, such as three-way calling, a handoff of a call, etc.
Abstract:
A method for distinguishing speakers in a conference call of a plurality of participants, in which method speech frames of the conference call are received in a receiving unit, which speech frames include encoded speech parameters. At least one parameter of the received speech frames is examined in an audio codec of the receiving unit, and the speech frames are classified to belong to one of the participants, the classification being carried out according to differences in the examined at least one speech parameter. These functions may be carried out in a speaker identification block, which is applicable in various positions of a teleconferencing processing chain. Finally, a spatialization effect is created in a terminal reproducing the audio signal according to notified differences by placing the participants at distinct positions in an acoustical space of the audio signal.
Abstract:
A method and system for identifying conference participants who dial in to a telephone conference of an electronic conference that includes a web conference is provided. To identify a conference participant, a conference system displays to a user the names of those conference participants who have not yet been associated with a telephone line of the telephone conference. The conference system plays to the user the identification announcement of a conference participant who is not yet associated with a telephone line. When the user hears the identification announcement, the user recognizes the name of the conference participant and selects the name of that conference participant from the displayed names. The conference system can then associate that conference participant with the telephone line associated with the identification announcement that was played to the user.
Abstract:
Authentication of voice message recipient network addresses employs generating (102) and storing (104) a "network file" that includes "voice clips" and associated network addresses that are extracted from voice messages received across a network (10) from voice message systems (16, 18). A voice clip is the first one to three seconds of voice extracted from each received voice message. Over time, the network file will grow to contain multiple voice clips and associated network voice message addresses. When a voice message originator subsequently enters a recipient's network address (106), the originating voice message system searches (114) the network file for the network address, retrieves the associated voice clip (116), and plays it for the voice message originator to authenticate the recipient's network address. Voice authentication of a voice message originator entails encoding (134) into a "voice print file", original voice clips and associated network addresses received from positively identified voice message originators. Thereafter, when a questionable voice message is received (138), the voice message system extracts a new voice clip (142), generates a new voice print (144), and compares it with the original voice print associated with the voice message address (148). If the voice prints are substantially the same, the received voice message is annotated with an "authenticating" message (150).