Invention Grant
- Patent Title: System and method for inserting a description of images into audio recordings
- Patent Title (中): 将图像描述插入音频记录的系统和方法
-
Application No.: US11866495Application Date: 2007-10-03
-
Publication No.: US07996227B2Publication Date: 2011-08-09
- Inventor: Peter C. Boyle , Yu Zhang
- Applicant: Peter C. Boyle , Yu Zhang
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Hoffman Warnick LLC
- Priority: CA2567505 20061109
- Main IPC: G10L11/00
- IPC: G10L11/00 ; G10L15/26 ; G06F17/27 ; G06K9/72

Abstract:
There is disclosed a system and method for interpreting and describing graphic images. In an embodiment, the method of inserting a description of an image into an audio recording includes: interpreting an image and producing a word description of the image including at least one image keyword; parsing an audio recording into a plurality of audio clips, and producing a transcription of each audio clip, each audio clip transcription including at least one audio keyword; calculating a similarity distance between the at least one image keyword and the at least one audio keyword of each audio clip; and selecting the audio clip transcription having a shortest similarity distance to the at least one image keyword as a location to insert the word description of the image. The word description of the image can then be appended to the selected audio clip to produce an augmented audio recording including the interpreted word description of the image.
Public/Granted literature
- US20080114601A1 SYSTEM AND METHOD FOR INSERTING A DESCRIPTION OF IMAGES INTO AUDIO RECORDINGS Public/Granted day:2008-05-15
Information query