UAV video aesthetic quality evaluation method based on multi-modal deep learning

Invention Grant

US11568637B2 UAV video aesthetic quality evaluation method based on multi-modal deep learning 有权

Please log in to see more content

Patent Title: UAV video aesthetic quality evaluation method based on multi-modal deep learning
Application No.: US16997825

Application Date: 2020-08-19
Publication No.: US11568637B2

Publication Date: 2023-01-31
Inventor: Bin Zhou , Qi Kuang , Qinpin Zhao
Applicant: Beihang University
Applicant Address: CN Beijing
Assignee: Beihang University
Current Assignee: Beihang University
Current Assignee Address: CN Beijing
Agency: Dragon Sun Law Firm, PC
Agent Jinggao Li, Esq.; Nathaniel Perkins
Priority: CN201911146496.2 20191121
Main IPC: G06V20/13
IPC: G06V20/13 ; B64C39/02 ; G06K9/62 ; G06N3/08 ; G06T9/00 ; G08G5/00

Abstract:

The present disclosure provides a UAV video aesthetic quality evaluation method based on multi-modal deep learning, which establishes a UAV video aesthetic evaluation data set, analyzes the UAV video through a multi-modal neural network, extracts high-dimensional features, and concatenates the extracted features, thereby achieving aesthetic quality evaluation of the UAV video. There are four steps, step one to: establish a UAV video aesthetic evaluation data set, which is divided into positive samples and negative samples according to the video shooting quality; step two to: use SLAM technology to restore the UAV's flight trajectory and to reconstruct a sparse 3D structure of the scene; step three to: through a multi-modal neural network, extract features of the input UAV video on the image branch, motion branch, and structure branch respectively; and step four to: concatenate the features on multiple branches to obtain the final video aesthetic label and video scene type.

Public/Granted literature

US20210158008A1 UAV Video Aesthetic Quality Evaluation Method Based On Multi-Modal Deep Learning Public/Granted day:2021-05-27

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V20/00	场景；特定场景元素（控制数码相机 H04N5/232）
G06V20/10	.陆地场景（使用静态摄像机监控的场景G06V20/52）（从车辆外部感知的场景 G06V20/56）（从车辆内部感知的场景 G06V20/59）
G06V20/13	..卫星图像