Image captioning augmented with understanding of the surrounding text

Invention Grant

US10915572B2 Image captioning augmented with understanding of the surrounding text 有权

Please log in to see more content

Patent Title: Image captioning augmented with understanding of the surrounding text
Application No.: US16205880

Application Date: 2018-11-30
Publication No.: US10915572B2

Publication Date: 2021-02-09
Inventor: Priscilla Santos Moraes , Shunguo Yan
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Applicant Address: US NY Armonk
Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee Address: US NY Armonk
Agent Brian D. Welle
Main IPC: G06F16/583
IPC: G06F16/583 ; G06F16/51 ; G06F16/901

Image captioning augmented with understanding of the surrounding text

Abstract:

To augment an image caption, a caption graph containing entity nodes corresponding to entities contained in the image and relationship edges between entity nodes corresponding to relationships between entities as illustrated in the image is generated. In addition, a contextual graph containing one or more of entity nodes corresponding to entities contained in the image and described in text associated with the image, textual entity nodes corresponding to textual entities described in text associated with the image and textual relationship edges between entity node pairs, textual entity node pairs and entity node and textual entity node pairs is generated. The textual relationship edges correspond to relationships described in the text associated with the image between entity pairs, textual entity pairs or entity and textual entity pairs. From the contextual graph, an augmented caption graph containing entity nodes, relationship edges, textual entities and textual relationship edges is generated.

Public/Granted literature

US20200175063A1 IMAGE CAPTIONING AUGMENTED WITH UNDERSTANDING OF THE SURROUNDING TEXT Public/Granted day:2020-06-04

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F16/00	信息检索；数据库结构；文件系统结构
G06F16/50	.•静态图像数据
G06F16/58	..••使用元数据的特征检索,例如,不来自内容或者元数据派生的
G06F16/583	...•••使用从内容中自动派生的元数据