System and method for forecasting location of target in monocular first person view

Invention Grant

US11893751B2 System and method for forecasting location of target in monocular first person view 有权

Please log in to see more content

Patent Title: System and method for forecasting location of target in monocular first person view
Application No.: US17405060

Application Date: 2021-08-18
Publication No.: US11893751B2

Publication Date: 2024-02-06
Inventor: Junaid Ahmed Ansari , Brojeshwar Bhowmick
Applicant: Tata Consultancy Services Limited
Applicant Address: IN Mumbai
Assignee: Tata Consultancy Services Limited
Current Assignee: Tata Consultancy Services Limited
Current Assignee Address: IN Mumbai
Agency: Finnegan, Henderson, Farabow, Garrett & Dunner, LLP
Priority: IN 2021038986 2020.09.09
Main IPC: G06K9/00
IPC: G06K9/00 ; G06T7/215 ; G06T7/246 ; G06N3/0442

Abstract:

This disclosure relates generally to system and method for forecasting location of target in monocular first person view. Conventional systems for location forecasting utilizes complex neural networks and hence are computationally intensive and requires high compute power. The disclosed system includes an efficient and light-weight RNN based network model for predicting motion of targets in first person monocular videos. The network model includes an auto-encoder in the encoding phase and a regularizing layer in the end helps us get better accuracy. The disclosed method relies entirely just on detection bounding boxes for prediction as well as training of the network model and is still capable of transferring zero-shot on a different dataset.

Public/Granted literature

US20220076431A1 SYSTEM AND METHOD FOR FORECASTING LOCATION OF TARGET IN MONOCULAR FIRST PERSON VIEW Public/Granted day:2022-03-10

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )