-
公开(公告)号:KR100792016B1
公开(公告)日:2008-01-04
申请号:KR1020060069845
申请日:2006-07-25
Applicant: 한국항공대학교산학협력단
Abstract: A device and a method for summarizing a video on the basis of a character are provided to provide a video summarization by character using audio and video information. A device for summarizing a video on the basis of a character using audio and video information comprises a speaker detecting unit(100), a face sensing unit(200), and a video summarizing unit(300). The speaker detecting unit detects a main speaker by detecting a speaker by using auditory information and provides summarization of a specific character unit. The face sensing unit senses a key frame which shows a specific character by sensing a face portion using visual information. The video summarizing unit performs a video summarizing operation based on the character by using a video summarization result based on the speaker, and a face sensing result using the visual information at the face sensing unit. A method thereof includes a step of detecting the main speaker by sensing the speaker by using the auditory information and providing the summarization of the specific character unit; a step of detecting the key frame showing the specific character by detecting the face portion using the visual information; and a step of summarizing the video based on the character by using the video summarizing result and the face sensing result.
Abstract translation: 提供用于基于字符来汇总视频的设备和方法,以通过使用音频和视频信息的字符提供视频摘要。 基于使用音频和视频信息的字符来总结视频的设备包括扬声器检测单元(100),人脸感测单元(200)和视频摘录单元(300)。 扬声器检测单元通过使用听觉信息检测扬声器来检测主扬声器,并提供特定字符单元的概括。 面部感测单元通过使用视觉信息感测脸部部分来感测显示特定角色的关键帧。 视频总结单元通过使用基于扬声器的视频摘要结果和使用面部感测单元处的视觉信息的人脸感测结果,基于该角色执行视频总结操作。 其方法包括通过使用听觉信息感测扬声器并提供特定字符单元的总结来检测主扬声器的步骤; 通过使用所述视觉信息检测所述面部部分来检测示出所述特定字符的关键帧的步骤; 以及通过使用视频摘要结果和面部感测结果来基于角色来总结视频的步骤。