Method and system for video action classification by mixing 2D and 3D features

Invention Grant

US11276249B2 Method and system for video action classification by mixing 2D and 3D features 有权

Please log in to see more content

Patent Title: Method and system for video action classification by mixing 2D and 3D features
Application No.: US15931796

Application Date: 2020-05-14
Publication No.: US11276249B2

Publication Date: 2022-03-15
Inventor: Han Na , Rei Odaira
Applicant: International Business Machines Corporation
Applicant Address: US NY Armonk
Assignee: International Business Machines Corporation
Current Assignee: International Business Machines Corporation
Current Assignee Address: US NY Armonk
Agency: Terrile, Cannatti & Chambers, LLP
Agent Michael Rocco Cannatti
Main IPC: G06V20/00
IPC: G06V20/00 ; G06V20/40 ; G06N3/08 ; G06K9/62

Method and system for video action classification by mixing 2D and 3D features

Abstract:

A method, system, and computer program product provide for video action classification by selecting a first video frame and a first plurality of video frames from a received video to process the first video frame with a 2D convolutional neural network processing pathway to extract spatial features classifying the first video frame, and to process the first plurality of video frames with a 3D convolutional neural network processing pathway to extract spatiotemporal features classifying the first plurality of video frames so that the spatial features are combined with the spatiotemporal features to generate a classification label for the video action.

Public/Granted literature

US20210357647A1 Method and System for Video Action Classification by Mixing 2D and 3D Features Public/Granted day:2021-11-18

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V20/00	场景；特定场景元素（控制数码相机 H04N5/232）