Invention Grant
- Patent Title: Automatically identifying speakers in real-time through media processing with dialog understanding supported by AI techniques
-
Application No.: US15967829Application Date: 2018-05-01
-
Publication No.: US10762906B2Publication Date: 2020-09-01
- Inventor: Marcio Ferreira Moreno , Helon Vicente Hultmann Ayala , Daniel Salles Chevitarese , Rafael R. de Mello Brandao , Renato Fontoura de Gusmao Cerqueira
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Scully, Scott, Murphy & Presser, P.C.
- Agent Joseph Petrokaitis
- Main IPC: G10L17/22
- IPC: G10L17/22 ; G10L17/26 ; G10L15/26

Abstract:
Automatically identifying speakers in real-time through media processing with dialog understanding. A plurality of audio streams may be received, an audio stream representing a speech of a participant speaking during an online meeting. A voice characteristic of a voice corresponding to the speech of the participant in the audio stream may be determined. The plurality of audio streams may be converted into text and a natural language processing may be performed to determine content context of the dialog. The natural language processing infers a name to associate with the voice in the audio stream based on the determined content context. A data structure linking the name with the voice may be created and stored in a knowledge base. A user interface associated with the online meeting application is triggered to present the name or identity of the speaker.
Public/Granted literature
Information query