Determining visual overlap of images by using box embeddings

Invention Grant

US12073601B2 Determining visual overlap of images by using box embeddings 有权

Please log in to see more content

Patent Title: Determining visual overlap of images by using box embeddings
Application No.: US18486947

Application Date: 2023-10-13
Publication No.: US12073601B2

Publication Date: 2024-08-27
Inventor: Anita Rau , Guillermo Garcia-Hernando , Gabriel J. Brostow , Daniyar Turmukhambetov
Applicant: Niantic, Inc.
Applicant Address: US CA San Francisco
Assignee: NIANTIC, INC.
Current Assignee: NIANTIC, INC.
Current Assignee Address: US CA San Francisco
Agency: FENWICK & WEST LLP
The original application number of the division: US17398443 2021.08.10
Main IPC: G06V10/75
IPC: G06V10/75 ; G06F18/214 ; G06N3/088 ; G06V10/42 ; G06V10/50

Determining visual overlap of images by using box embeddings

Abstract:

An image matching system for determining visual overlaps between images by using box embeddings is described herein. The system receives two images depicting a 3D surface with different camera poses. The system inputs the images (or a crop of each image) into a machine learning model that outputs a box encoding for the first image and a box encoding for the second image. A box encoding includes parameters defining a box in an embedding space. Then the system determines an asymmetric overlap factor that measures asymmetric surface overlaps between the first image and the second image based on the box encodings. The asymmetric overlap factor includes an enclosure factor indicating how much surface from the first image is visible in the second image and a concentration factor indicating how much surface from the second image is visible in the first image.

Public/Granted literature

US20240046610A1 DETERMINING VISUAL OVERLAP OF IMAGES BY USING BOX EMBEDDINGS Public/Granted day:2024-02-08

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V10/00	图像或视频识别或理解的安排（图像或视频中的字符识别 G06V30/10）
G06V10/70	.使用模式识别或机器学习（光学模式识别或电子计算 G06V10/88）
G06V10/74	..图像或视频模式匹配；特征空间中的邻近度量
G06V10/75	...匹配过程的组织，例如图像或视频特征的同步或顺序比较；粗细方法，例如多尺度方法；使用上下文分析；字典的选择