Invention Grant
- Patent Title: System and method for enhancing machine learning model for audio/video understanding using gated multi-level attention and temporal adversarial training
-
Application No.: US17387889Application Date: 2021-07-28
-
Publication No.: US11989939B2Publication Date: 2024-05-21
- Inventor: Saurabh Sahu , Palash Goyal
- Applicant: Samsung Electronics Co., Ltd.
- Applicant Address: KR Suwon-si
- Assignee: Samsung Electronics Co., Ltd.
- Current Assignee: Samsung Electronics Co., Ltd.
- Current Assignee Address: KR Suwon-si
- Main IPC: G06V20/40
- IPC: G06V20/40 ; G06F18/214

Abstract:
A method includes obtaining, using at least one processor, audio/video content. The method also includes processing, using the at least one processor, the audio/video content with a trained attention-based machine learning model to classify the audio/video content. Processing the audio/video content includes, using the trained attention-based machine learning model, generating a global representation of the audio/video content based on the audio/video content, generating a local representation of the audio/video content based on different portions of the audio/video content, and combining the global representation of the audio/video content and the local representation of the audio/video content to generate an output representation of the audio/video content. The audio/video content is classified based on the output representation.
Public/Granted literature
Information query