Video retrieval techniques using video contrastive learning

Invention Grant

US12277171B2 Video retrieval techniques using video contrastive learning 有权

Please log in to see more content

Patent Title: Video retrieval techniques using video contrastive learning
Application No.: US18179617

Application Date: 2023-03-07
Publication No.: US12277171B2

Publication Date: 2025-04-15
Inventor: Xiao Xia Mao , Wei Jun Zheng , Shi Hui Gui , Xiao Feng Ji
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Applicant Address: US NY Armonk
Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
Current Assignee Address: US NY Armonk
Agent Lily Neff
Main IPC: G06V20/70
IPC: G06V20/70 ; G06F16/78 ; G06F40/30 ; G06V10/74 ; G06V10/774 ; G06V10/82 ; G06V30/19

Video retrieval techniques using video contrastive learning

Abstract:

A method, computer system, and a computer program product are provided for training a neural network for finding queried videos. Two pairs of video clips and associated text are obtained from a first dataset and a second dataset. The first dataset is used to train two video encoders by providing the video clips to the encoders as input and providing the outputs to a cosine similarity calculator. The second dataset is used to train a multi-mentor paradigm with two mentors. A first mentor and a second mentor are each provided the pair of textual data inputs. The first mentor provides a similarity value comparison, and the second mentor provides a word mover distance. Using the output from the multi-mentor paradigm and the encoders, a contrastive loss is calculated and used to provide contrastive learning of video features by differentiating similarity and dissimilarity of the video clips.

Public/Granted literature

US20240303272A1 VIDEO RETRIEVAL TECHNIQUES USING VIDEO CONTRASTIVE LEARNING Public/Granted day:2024-09-12

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V20/00	场景；特定场景元素（控制数码相机 H04N5/232）
G06V20/70	.标记场景内容，例如派生句法或语义表示