Automatically predicting text in images

Invention Grant

US12159452B2 Automatically predicting text in images 有权

Please log in to see more content

Patent Title: Automatically predicting text in images
Application No.: US17309255

Application Date: 2019-11-14
Publication No.: US12159452B2

Publication Date: 2024-12-03
Inventor: Perouz Taslakian , Negin Sokhandan Asl
Applicant: SERVICENOW CANADA INC.
Applicant Address: CA Montréal
Assignee: SERVICENOW CANADA INC.
Current Assignee: SERVICENOW CANADA INC.
Current Assignee Address: CA Montréal
Agency: FASKEN MARTINEAU DUMOULIN LLP
Agent Johann Gest
International Application: PCT/CA2019/051627 WO 20191114
International Announcement: WO2020/097734 WO 20200522
Main IPC: G06V10/82
IPC: G06V10/82 ; G06F16/538 ; G06F16/583 ; G06V20/62 ; G06V30/10 ; G06V30/14 ; G06V30/148 ; G06V30/19 ; G06V30/262

Abstract:

Systems and methods for detecting and predicting text within images. An image is passed to a feature-extraction module. Each image typically contains at least one text object, and each text object contains at least one character. Based on the image, the feature-extraction module generates at least one feature map indicating text object(s) in the image. The feature map(s) is then passed to a decoder module. In son implementations, the decoder module applies a weighted mask to the feature map(s). Based on the feature map(s), the decoder module predicts a sequence of characters in the text object(s). In some embodiments, that prediction is based on previous known data. The decoder module is directed by a query that indicates at least one desired characteristic of the text object(s). An output module then refines the predicted content. At least one neural network may be used.

Public/Granted literature

US20220019834A1 AUTOMATICALLY PREDICTING TEXT IN IMAGES Public/Granted day:2022-01-20

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V10/00	图像或视频识别或理解的安排（图像或视频中的字符识别 G06V30/10）
G06V10/70	.使用模式识别或机器学习（光学模式识别或电子计算 G06V10/88）
G06V10/82	..使用神经网络