Semantically-aware image-based visual localization

    公开(公告)号:US11361470B2

    公开(公告)日:2022-06-14

    申请号:US16667047

    申请日:2019-10-29

    Abstract: A method, apparatus and system for visual localization includes extracting appearance features of an image, extracting semantic features of the image, fusing the extracted appearance features and semantic features, pooling and projecting the fused features into a semantic embedding space having been trained using fused appearance and semantic features of images having known locations, computing a similarity measure between the projected fused features and embedded, fused appearance and semantic features of images, and predicting a location of the image associated with the projected, fused features. An image can include at least one image from a plurality of modalities such as a Light Detection and Ranging image, a Radio Detection and Ranging image, or a 3D Computer Aided Design modeling image, and an image from a different sensor, such as an RGB image sensor, captured from a same geo-location, which is used to determine the semantic features of the multi-modal image.

    Centimeter human skeleton pose estimation

    公开(公告)号:US11263443B2

    公开(公告)日:2022-03-01

    申请号:US16868870

    申请日:2020-05-07

    Abstract: A method, apparatus and system for human skeleton pose estimation includes synchronously capturing images of a human moving through an area from a plurality of different points of view, for each of the plurality of captured images, determining a bounding box that bounds the human in the captured image and identifying pixel locations of the bounding box in the image, for each of the plurality of captured images, determining 2D and single-view 3D skeletons from the pixel locations of the bounding box, determining a first, multi-view 3D skeleton using a combination of the 2D and single-view 3D skeletons, and optimizing the first, multi-view 3D skeleton to determine a final 3D skeleton pose for the human. The method, apparatus and system can further include illuminating the area with structured light during the capturing of the images of the human moving through the area.

    CENTIMETER HUMAN SKELETON POSE ESTIMATION

    公开(公告)号:US20210019507A1

    公开(公告)日:2021-01-21

    申请号:US16868870

    申请日:2020-05-07

    Abstract: A method, apparatus and system for human skeleton pose estimation includes synchronously capturing images of a human moving through an area from a plurality of different points of view, for each of the plurality of captured images, determining a bounding box that bounds the human in the captured image and identifying pixel locations of the bounding box in the image, for each of the plurality of captured images, determining 2D and single-view 3D skeletons from the pixel locations of the bounding box, determining a first, multi-view 3D skeleton using a combination of the 2D and single-view 3D skeletons, and optimizing the first, multi-view 3D skeleton to determine a final 3D skeleton pose for the human. The method, apparatus and system can further include illuminating the area with structured light during the capturing of the images of the human moving through the area.

    ADVANCED CONTROL SYSTEM WITH MULTIPLE CONTROL PARADIGMS

    公开(公告)号:US20200218253A1

    公开(公告)日:2020-07-09

    申请号:US16639216

    申请日:2018-08-17

    Abstract: A hybrid control system includes a control agent and a control engine. The control engine is configured to install a master plan to the control agent. The master plan includes a plurality of high-level tasks. The control agent is configured to operate according to the master plan to, for each high-level task of the high-level tasks, obtain one or more low-level controls and to perform the one or more low-level controls to realize the high-level task. The control agent is configured to operate according to the master plan to transition between the plurality of high-level tasks thereby causing a seamless transition between operating at least partially autonomously and operating at least partially based on input from the tele-operator, based at least on context for the control agent, to operate at least partially autonomously and at least partially based on input from the tele-operator during execution of the master plan.

    SEMANTIC VISUAL LANDMARKS FOR NAVIGATION
    40.
    发明申请

    公开(公告)号:US20190114507A1

    公开(公告)日:2019-04-18

    申请号:US16163273

    申请日:2018-10-17

    Abstract: Techniques are disclosed for improving navigation accuracy for a mobile platform. In one example, a navigation system comprises an image sensor that generates a plurality of images, each image comprising one or more features. A computation engine executing on one or more processors of the navigation system processes each image of the plurality of images to determine a semantic class of each feature of the one or more features of the image. The computation engine determines, for each feature of the one or more features of each image and based on the semantic class of the feature, whether to include the feature as a constraint in a navigation inference engine. The computation engine generates, based at least on features of the one or more features included as constraints in the navigation inference engine, navigation information. The computation engine outputs the navigation information to improve navigation accuracy for the mobile platform.

Patent Agency Ranking