Generating semantic scene graphs from ungrounded label graphs and visual graphs for digital images

Invention Grant

US11989923B2 Generating semantic scene graphs from ungrounded label graphs and visual graphs for digital images 有权

Please log in to see more content

Patent Title: Generating semantic scene graphs from ungrounded label graphs and visual graphs for digital images
Application No.: US17483126

Application Date: 2021-09-23
Publication No.: US11989923B2

Publication Date: 2024-05-21
Inventor: Ning Xu , Jing Shi
Applicant: Adobe Inc.
Applicant Address: US CA San Jose
Assignee: Adobe Inc.
Current Assignee: Adobe Inc.
Current Assignee Address: US CA San Jose
Agency: Keller Preece PLLC
Main IPC: G06K9/46
IPC: G06K9/46 ; G06F18/21 ; G06F18/22 ; G06F40/205 ; G06K9/62 ; G06N3/02 ; G06V10/426

Generating semantic scene graphs from ungrounded label graphs and visual graphs for digital images

Abstract:

This disclosure describes methods, non-transitory computer readable storage media, and systems that utilize weakly supervised graph matching to align an ungrounded label graph and a visual graph corresponding to a digital image. Specifically, the disclosed system utilizes a label embedding model to generate label graph embeddings from the ungrounded label graph and a visual embedding network to generate visual graph embeddings from the visual graph. Additionally, the disclosed system determines similarity metrics indicating the similarity of pairs of label graph embeddings and visual graph embeddings. The disclosed system then generates a semantic scene graph by utilizing a graph matching algorithm to align the ungrounded label graph and the visual graph based on the similarity metrics. In some embodiments, the disclosed system utilizes contrastive learning to modify the embedding models. Furthermore, in additional embodiments, the disclosed system utilizes the semantic scene graph to train a scene graph generation neural network.

Information query

Espacenet