Systems and methods for determining actions depicted in media contents based on attention weights of media content frames

Invention Grant

US11055537B2 Systems and methods for determining actions depicted in media contents based on attention weights of media content frames 有权

Please log in to see more content

Patent Title: Systems and methods for determining actions depicted in media contents based on attention weights of media content frames
Application No.: US15202471

Application Date: 2016-07-05
Publication No.: US11055537B2

Publication Date: 2021-07-06
Inventor: Atousa Torabi , Leonid Sigal
Applicant: Disney Enterprises, Inc.
Applicant Address: US CA Burbank
Assignee: Disney Enterprises, Inc.
Current Assignee: Disney Enterprises, Inc.
Current Assignee Address: US CA Burbank
Agency: Farjami & Farjami LLP
Main IPC: G06K9/00
IPC: G06K9/00 ; G06K9/46 ; G11B27/10 ; H04L29/06 ; G06T7/62 ; G06T7/90 ; G06N3/04 ; G06N3/08

Systems and methods for determining actions depicted in media contents based on attention weights of media content frames

Abstract:

There is provided a system comprising a label database including a plurality of label, a non-transitory memory storing an executable code, and a hardware processor executing the executable code to receive a media content including a plurality of segments, each segment including a plurality of frames, extract a first plurality of features from a segment, extract a second plurality of features from each frame of the segment, determine an attention weight for each frame of the segment based on the first plurality of features extracted from the segment and the second plurality of features extracted from the segment, and determine that the segment depicts one of the plurality of labels in a label database based on the first plurality of features, the second plurality of features, and the attention weight of each frame of the plurality of frames of the segment.

Public/Granted literature

US20170308754A1 Systems and Methods for Determining Actions Depicted in Media Contents Based on Attention Weights of Media Content Frames Public/Granted day:2017-10-26

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )