-
公开(公告)号:US11361470B2
公开(公告)日:2022-06-14
申请号:US16667047
申请日:2019-10-29
Applicant: SRI International
Inventor: Han-Pang Chiu , Zachary Seymour , Karan Sikka , Supun Samarasekera , Rakesh Kumar , Niluthpol Mithun
IPC: G01S17/89 , G01S7/48 , G06F16/583 , G06K9/62 , G06T7/00 , G06T7/73 , G06V10/44 , G06V10/82 , G06V20/00
Abstract: A method, apparatus and system for visual localization includes extracting appearance features of an image, extracting semantic features of the image, fusing the extracted appearance features and semantic features, pooling and projecting the fused features into a semantic embedding space having been trained using fused appearance and semantic features of images having known locations, computing a similarity measure between the projected fused features and embedded, fused appearance and semantic features of images, and predicting a location of the image associated with the projected, fused features. An image can include at least one image from a plurality of modalities such as a Light Detection and Ranging image, a Radio Detection and Ranging image, or a 3D Computer Aided Design modeling image, and an image from a different sensor, such as an RGB image sensor, captured from a same geo-location, which is used to determine the semantic features of the multi-modal image.