Systems and methods for jointly training a machine-learning-based monocular optical flow, depth, and scene flow estimator

Invention Grant

US11948310B2 Systems and methods for jointly training a machine-learning-based monocular optical flow, depth, and scene flow estimator 有权

Please log in to see more content

Patent Title: Systems and methods for jointly training a machine-learning-based monocular optical flow, depth, and scene flow estimator
Application No.: US17489237

Application Date: 2021-09-29
Publication No.: US11948310B2

Publication Date: 2024-04-02
Inventor: Vitor Guizilini , Rares A. Ambrus , Kuan-Hui Lee , Adrien David Gaidon
Applicant: Toyota Research Institute, Inc.
Applicant Address: US CA Los Altos
Assignee: Toyota Research Institute, Inc.
Current Assignee: Toyota Research Institute, Inc.
Current Assignee Address: US CA Los Altos
Agency: Darrow Mustafa PC
Agent Christopher G. Darrow
Main IPC: G06K9/00
IPC: G06K9/00 ; G05D1/00 ; G06N3/045 ; G06N3/08 ; G06T7/246 ; G06T7/50 ; G06T7/55 ; G06T7/73

Systems and methods for jointly training a machine-learning-based monocular optical flow, depth, and scene flow estimator

Abstract:

Systems and methods described herein relate to jointly training a machine-learning-based monocular optical flow, depth, and scene flow estimator. One embodiment processes a pair of temporally adjacent monocular image frames using a first neural network structure to produce a first optical flow estimate; processes the pair of temporally adjacent monocular image frames using a second neural network structure to produce an estimated depth map and an estimated scene flow; processes the estimated depth map and the estimated scene flow using the second neural network structure to produce a second optical flow estimate; and imposes a consistency loss between the first optical flow estimate and the second optical flow estimate that minimizes a difference between the first optical flow estimate and the second optical flow estimate to improve performance of the first neural network structure in estimating optical flow and the second neural network structure in estimating depth and scene flow.

Public/Granted literature

US20220392083A1 SYSTEMS AND METHODS FOR JOINTLY TRAINING A MACHINE-LEARNING-BASED MONOCULAR OPTICAL FLOW, DEPTH, AND SCENE FLOW ESTIMATOR Public/Granted day:2022-12-08

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )