Context based position estimation of target of interest in videos

Invention Grant

US10762662B2 Context based position estimation of target of interest in videos 有权

Please log in to see more content

Patent Title: Context based position estimation of target of interest in videos
Application No.: US16299365

Application Date: 2019-03-12
Publication No.: US10762662B2

Publication Date: 2020-09-01
Inventor: Srinivasa Rao Chalamala , Balakrishna Gudla , Krishna Rao Kakkirala
Applicant: Tata Consultancy Services Limited
Applicant Address: IN Mumbai
Assignee: Tata Consultancy Services Limited
Current Assignee: Tata Consultancy Services Limited
Current Assignee Address: IN Mumbai
Agency: Finnegan, Henderson, Farabow, Garrett & Dunner, LLP
Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@51f288ee
Main IPC: G06K9/00
IPC: G06K9/00 ; G06T7/73 ; G06K9/62 ; G06T7/20

Context based position estimation of target of interest in videos

Abstract:

Target tracking in a video is a highly challenging problem as the target may be effected by its appearance changes along the video, partial occlusions, background clutter, illumination variations, surrounding environment and also due to changes in the motion of the target. Embodiments of the present disclosure address this problem by implementing neural network for convolution feature maps and their gradient maps generation. The proposed two-class neural network (TCNN) is guided by feeding it target of interest defined by a bounding box in a first frame of the video. With this target guidance TCNN generates target activation map by using convolutional features and gradient maps. Target activation map gives tentative location of target, and this is further exploited to locate target precisely by using correlation filter(s) and peak location estimator based on identified context. This process repeats for every frame of the video to track the target accurately.

Public/Granted literature

US20190287264A1 CONTEXT BASED POSITION ESTIMATION OF TARGET OF INTEREST IN VIDEOS Public/Granted day:2019-09-19

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )