-
公开(公告)号:US20230351767A1
公开(公告)日:2023-11-02
申请号:US17732421
申请日:2022-04-28
Applicant: TOYOTA RESEARCH INSTITUTE, INC.
Inventor: Arjun BHARGAVA , Chao FANG , Charles Christopher OCHOA , Kun-Hsin CHEN , Kuan-Hui LEE , Vitor GUIZILINI
CPC classification number: G06V20/58 , B60W60/001 , G06V20/49 , B60W2420/42 , B60W2420/52
Abstract: A method for generating a dense light detection and ranging (LiDAR) representation by a vision system includes receiving, at a sparse depth network, one or more sparse representations of an environment. The method also includes generating a depth estimate of the environment depicted in an image captured by an image capturing sensor. The method further includes generating, via the sparse depth network, one or more sparse depth estimates based on receiving the one or more sparse representations. The method also includes fusing the depth estimate and the one or more sparse depth estimates to generate a dense depth estimate. The method further includes generating the dense LiDAR representation based on the dense depth estimate and controlling an action of the vehicle based on identifying a three-dimensional object in the dense LiDAR representation.
-
2.
公开(公告)号:US20210097266A1
公开(公告)日:2021-04-01
申请号:US16590275
申请日:2019-10-01
Applicant: TOYOTA RESEARCH INSTITUTE, INC. , THE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIVERSITY
Inventor: Karttikeya MANGALAM , Ehsan ADELI-MOSABBEB , Kuan-Hui LEE , Adrien GAIDON , Juan Carlos NIEBLES DUQUE
Abstract: A method for predicting spatial positions of several key points on a human body in the near future in an egocentric setting is described. The method includes generating a frame-level supervision for human poses. The method also includes suppressing noise and filling missing joints of the human body using a pose completion module. The method further includes splitting the poses into a global stream and a local stream. Furthermore, the method includes combining the global stream and the local stream to forecast future human locomotion.
-
公开(公告)号:US20220414981A1
公开(公告)日:2022-12-29
申请号:US17895603
申请日:2022-08-25
Applicant: TOYOTA RESEARCH INSTITUTE, INC.
Inventor: Arjun BHARGAVA , Sudeep PILLAI , Kuan-Hui LEE , Kun-Hsin CHEN
Abstract: A method for 3D object modeling includes linking 2D semantic keypoints of an object within a video stream into a 2D structured object geometry. The method includes inputting, to a neural network, the object to generate a 2D NOCS image and a shape vector, the shape vector being mapped to a continuously traversable coordinate shape. The method includes applying a differentiable shape renderer to the SDF shape and the 2D NOCS image to render a shape of the object corresponding to a 3D object model in the continuously traversable coordinate shape space. The method includes lifting the linked, 2D semantic keypoints of the 2D structured object geometry to a 3D structured object geometry. The method includes geometrically and projectively aligning the 3D object model, the 3D structured object geometry, and the rendered shape to form a rendered object. The method includes generating 3D bounding boxes from the rendered object.
-
公开(公告)号:US20210358296A1
公开(公告)日:2021-11-18
申请号:US16876699
申请日:2020-05-18
Applicant: TOYOTA RESEARCH INSTITUTE, INC.
Inventor: Kuan-Hui LEE , Matthew T. Kliemann , Adrien David Gaidon
Abstract: Systems and methods determining velocity of an object associated with a three-dimensional (3D) scene may include: a LIDAR system generating two sets of 3D point cloud data of the scene from two consecutive point cloud sweeps; a pillar feature network encoding data of the point cloud data to extract two-dimensional (2D) bird's-eye-view embeddings for each of the point cloud data sets in the form of pseudo images, wherein the 2D bird's-eye-view embeddings for a first of the two point cloud data sets comprises pillar features for the first point cloud data set and the 2D bird's-eye-view embeddings for a second of the two point cloud data sets comprises pillar features for the second point cloud data set; and a feature pyramid network encoding the pillar features and performing a 2D optical flow estimation to estimate the velocity of the object.
-
公开(公告)号:US20210295093A1
公开(公告)日:2021-09-23
申请号:US16827252
申请日:2020-03-23
Applicant: TOYOTA RESEARCH INSTITUTE, INC. , THE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIVERSITY
Inventor: Boxiao PAN , Haoye CAI , De-An HUANG , Kuan-Hui LEE , Adrien David GAIDON , Ehsan ADELI-MOSABBEB , Juan Carlos NIEBLES DUQUE
IPC: G06K9/62 , G06N3/08 , G06N3/04 , G06K9/00 , G06F16/9537 , G06F16/28 , G06F16/242 , G06F9/38
Abstract: A method for scene perception using video captioning based on a spatio-temporal graph model is described. The method includes decomposing the spatio-temporal graph model of a scene in input video into a spatial graph and a temporal graph. The method also includes modeling a two branch framework having an object branch and a scene branch according to the spatial graph and the temporal graph to learn object interactions between the object branch and the scene branch. The method further includes transferring the learned object interactions from the object branch to the scene branch as privileged information. The method also includes captioning the scene by aligning language logits from the object branch and the scene branch according to the learned object interactions.
-
公开(公告)号:US20250131739A1
公开(公告)日:2025-04-24
申请号:US19002249
申请日:2024-12-26
Applicant: TOYOTA RESEARCH INSTITUTE, INC.
Inventor: Kuan-Hui LEE , Charles Christopher OCHOA , Arjun BHARGAVA , Chao FANG
IPC: G06V20/58 , B60W40/04 , B60W60/00 , G06T7/246 , G06V10/764
Abstract: A method for controlling an ego vehicle in an environment includes detecting one or more changes in a position of an agent vehicle over time in accordance with capturing at least a first representation of the environment and a second representation of the environment via one or more sensors associated with the ego vehicle. The method also includes determining a velocity of the object based on detecting the one or more changes. The method further includes classifying the agent vehicle as parked based on the velocity and contextual data associated with the agent vehicle and/or the environment. The method still further includes planning a trajectory for the ego vehicle based on classifying the agent vehicle as parked. The method also includes controlling the ego vehicle to navigate along the trajectory.
-
公开(公告)号:US20230351766A1
公开(公告)日:2023-11-02
申请号:US17732393
申请日:2022-04-28
Applicant: TOYOTA RESEARCH INSTITUTE, INC.
Inventor: Kuan-Hui LEE , Charles Christopher OCHOA , Arjun BHARGAVA , Chao FANG
IPC: G06V20/58 , G06T7/246 , G06V10/764 , B60W60/00 , B60W40/04
CPC classification number: G06V20/58 , G06T7/248 , G06V10/764 , B60W60/001 , B60W40/04 , G06T2207/30252 , G06T2207/10028 , G06T2207/20081 , G06V2201/08 , B60W2554/20 , B60W2520/10 , B60W2420/52
Abstract: A method controlling an ego vehicle in an environment includes determining, via a flow model of a parked vehicle recognition system, a flow between a first representation of the environment and a second representation of the environment. The method also includes determining, via a velocity model of the parked vehicle recognition system, a velocity of a vehicle in the environment based on the flow. The method further includes determining, via a parked vehicle classification model of the parked vehicle recognition system, the vehicle is parked based on the velocity of the vehicle and one or more of features associated with the vehicle and/or the environment. The method still further includes planning a trajectory of the ego vehicle based on determining the vehicle is parked.
-
公开(公告)号:US20220245387A1
公开(公告)日:2022-08-04
申请号:US17167570
申请日:2021-02-04
Applicant: TOYOTA RESEARCH INSTITUTE, INC.
Inventor: Kuan-Hui LEE , Kun-Hsin CHEN , Haofeng CHEN , Arjun BHARGAVA , Sudeep PILLAI
IPC: G06K9/00 , B60W60/00 , G06N3/04 , G06F16/901
Abstract: A method for semantic keypoint detection is described. The method includes linking, using a keypoint graph neural network (KGNN), semantic keypoints of an object within a first image of a video stream into a 2D graph structure corresponding to a category of the object. The method also includes embedding descriptors within the semantic keypoints of the 2D graph structure corresponding to the category of the object. The method further includes tracking the object within subsequent images of the video stream using the embedded descriptors within the semantic keypoints of the 2D graph structure corresponding to the category of the object.
-
公开(公告)号:US20220222889A1
公开(公告)日:2022-07-14
申请号:US17147049
申请日:2021-01-12
Applicant: TOYOTA RESEARCH INSTITUTE, INC.
Inventor: Arjun BHARGAVA , Sudeep PILLAI , Kuan-Hui LEE , Kun-Hsin CHEN
Abstract: A method for 3D object modeling includes linking 2D semantic keypoints of an object within a video stream into a 2D structured object geometry. The method includes inputting, to a neural network, the object to generate a 2D NOCS image and a shape vector, the shape vector being mapped to a continuously traversable coordinate shape. The method includes applying a differentiable shape renderer to the SDF shape and the 2D NOCS image to render a shape of the object corresponding to a 3D object model in the continuously traversable coordinate shape space. The method includes lifting the linked, 2D semantic keypoints of the 2D structured object geometry to a 3D structured object geometry. The method includes geometrically and projectively aligning the 3D object model, the 3D structured object geometry, and the rendered shape to form a rendered object. The method includes generating 3D bounding boxes from the rendered object.
-
公开(公告)号:US20250037478A1
公开(公告)日:2025-01-30
申请号:US18917905
申请日:2024-10-16
Applicant: TOYOTA RESEARCH INSTITUTE, INC.
Inventor: Arjun BHARGAVA , Chao FANG , Charles Christopher OCHOA , Kun-Hsin CHEN , Kuan-Hui LEE , Vitor GUIZILINI
Abstract: A method for generating a dense light detection and ranging (LiDAR) representation by a vision system of a vehicle includes generating, at a depth estimation network, a depth estimate of an environment depicted in an image captured by an image capturing sensor integrated with the vehicle. The method also includes generating, via a sparse depth network, one or more sparse depth estimates of the environment, each sparse depth estimate associated with a respective sparse representation of one or more sparse representations. The method further includes generating the dense LiDAR representation based on a dense depth estimate that is generated based on the depth estimate and the one or more sparse depth estimates. The method still further includes controlling an action of the vehicle based on the dense LiDAR representation.
-
-
-
-
-
-
-
-
-