Invention Grant
- Patent Title: Multimodal identification and tracking of speakers in video
- Patent Title (中): 视频中的扬声器的多模式识别和跟踪
-
Application No.: US12118809Application Date: 2008-05-12
-
Publication No.: US07920761B2Publication Date: 2011-04-05
- Inventor: Arnon Amir , Giridharan Iyengar , Ran D. Zilca
- Applicant: Arnon Amir , Giridharan Iyengar , Ran D. Zilca
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Cantor Colburn LLP
- Agent Ann Dougherty
- Main IPC: G06K9/54
- IPC: G06K9/54

Abstract:
A computer program product includes machine readable instructions for providing enhanced video output by: receiving footage including likeness information in a plurality of modalities; demultiplexing the plurality of modalities to provide information for each modality; comparing information from at least two of the modalities for determining a correlation in the likeness information; using the correlation, obtaining semantic information for association with the likeness; and combining the semantic information with the likeness information for providing the enhanced video output. A system for implementing the computer program product includes resources for receiving the footage.
Public/Granted literature
- US20080247650A1 MULTIMODAL IDENTIFICATION AND TRACKING OF SPEAKERS IN VIDEO Public/Granted day:2008-10-09
Information query