System and method for multi-modal image classification

Invention Grant

US11544510B2 System and method for multi-modal image classification 有权

Please log in to see more content

Patent Title: System and method for multi-modal image classification
Application No.: US16509062

Application Date: 2019-07-11
Publication No.: US11544510B2

Publication Date: 2023-01-03
Inventor: Yogen Chaudhari , Sean Pinkney , Prashanth Venkatraman , Ashwath Rajendran , Jay Parikh
Applicant: comScore, Inc.
Applicant Address: US VA Reston
Assignee: comScore, Inc.
Current Assignee: comScore, Inc.
Current Assignee Address: US VA Reston
Agency: Blank Rome LLP
Main IPC: G06K9/62
IPC: G06K9/62 ; G06Q30/02 ; G06N3/08 ; G06F40/10 ; G06V30/148

System and method for multi-modal image classification

Abstract:

Systems and methods for classifying images (e.g., ads) are described. An image is accessed. Optical character recognition is performed on at least a first portion of the image. Image recognition is performed via a convolutional neural network on at least a second portion of the image. At least one class for the image is automatically identified, via a fully connected neural network, based on one or more predictions, each of the one or more predictions being based on both the optical character recognition and the image recognition. Finally, the at least one class identified for the image is output.

Public/Granted literature

US20210012145A1 SYSTEM AND METHOD FOR MULTI-MODAL IMAGE CLASSIFICATION Public/Granted day:2021-01-14

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )
G06K9/62	.应用电子设备进行识别的方法或装置