Action recognition with high-order interaction through spatial-temporal object tracking

Invention Grant

US11600067B2 Action recognition with high-order interaction through spatial-temporal object tracking 有权

Please log in to see more content

Patent Title: Action recognition with high-order interaction through spatial-temporal object tracking
Application No.: US17016260

Application Date: 2020-09-09
Publication No.: US11600067B2

Publication Date: 2023-03-07
Inventor: Farley Lai , Asim Kadav , Jie Chen
Applicant: NEC Laboratories America, Inc.
Applicant Address: US NJ Princeton
Assignee: NEC Laboratories America, Inc.
Current Assignee: NEC Laboratories America, Inc.
Current Assignee Address: US NJ Princeton
Agent Joseph Kolodka
Main IPC: G06V20/40
IPC: G06V20/40

Action recognition with high-order interaction through spatial-temporal object tracking

Abstract:

Aspects of the present disclosure describe systems, methods, and structures that provide action recognition with high-order interaction with spatio-temporal object tracking. Image and object features are organized into into tracks, which advantageously facilitates many possible learnable embeddings and intra/inter-track interaction(s). Operationally, our systems, method, and structures according to the present disclosure employ an efficient high-order interaction model to learn embeddings and intra/inter object track interaction across the space and time for AR. Each frame is detected by an object detector to locate visual objects. Those objects are linked through time to form object tracks. The object tracks are then organized and combined with the embeddings as the input to our model. The model is trained to generate representative embeddings and discriminative video features through high-order interaction which is formulated as an efficient matrix operation without iterative processing delay.

Public/Granted literature

US20210081673A1 ACTION RECOGNITION WITH HIGH-ORDER INTERACTION THROUGH SPATIAL-TEMPORAL OBJECT TRACKING Public/Granted day:2021-03-18

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V20/00	场景；特定场景元素（控制数码相机 H04N5/232）
G06V20/40	.在视频内容中（提取叠加文本 G06V20/62）（视频检索 G06F16/70）（在视频服务器中处理视频基本流H04N21/234）（在视频客户端中处理视频基本流H04N21/44）