Concept disambiguation using multimodal embeddings

Invention Grant

US12249116B2 Concept disambiguation using multimodal embeddings 有权

Please log in to see more content

Patent Title: Concept disambiguation using multimodal embeddings
Application No.: US17656147

Application Date: 2022-03-23
Publication No.: US12249116B2

Publication Date: 2025-03-11
Inventor: Venkata Naveen Kumar Yadav Marri , Ajinkya Gorakhnath Kale
Applicant: ADOBE INC.
Applicant Address: US CA San Jose
Assignee: ADOBE INC.
Current Assignee: ADOBE INC.
Current Assignee Address: US CA San Jose
Agency: F. CHAU & ASSOCIATES, LLC
Main IPC: G06V10/771
IPC: G06V10/771 ; G06N3/088 ; G06V10/74 ; G06V10/77 ; G06V10/774 ; G06V10/82

Concept disambiguation using multimodal embeddings

Abstract:

Systems and methods for image processing are described. Embodiments of the present disclosure identify a plurality of candidate concepts in a knowledge graph (KG) that correspond to an image tag of an image; generate an image embedding of the image using a multi-modal encoder; generate a concept embedding for each of the plurality of candidate concepts using the multi-modal encoder; select a matching concept from the plurality of candidate concepts based on the image embedding and the concept embedding; and generate association data between the image and the matching concept.

Public/Granted literature

US20230326178A1 CONCEPT DISAMBIGUATION USING MULTIMODAL EMBEDDINGS Public/Granted day:2023-10-12

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V10/00	图像或视频识别或理解的安排（图像或视频中的字符识别 G06V30/10）
G06V10/70	.使用模式识别或机器学习（光学模式识别或电子计算 G06V10/88）
G06V10/77	..处理特征空间中的图像或视频特征；使用数据集成或数据缩减，例如主成分分析 [PCA] 或独立成分分析 [ICA] 或自组织图 [SOM]；盲源分离
G06V10/771	...特征选择，例如从多维特征空间中选择代表性特征