Training energy-based models from a single image for internal learning and inference using trained models

Invention Grant

US12223706B2 Training energy-based models from a single image for internal learning and inference using trained models 有权

Please log in to see more content

Patent Title: Training energy-based models from a single image for internal learning and inference using trained models
Application No.: US17824694

Application Date: 2022-05-25
Publication No.: US12223706B2

Publication Date: 2025-02-11
Inventor: Zilong Zheng , Jianwen Xie , Ping Li
Applicant: Baidu USA, LLC
Applicant Address: US CA Sunnyvale
Assignee: Baidu USA, LLC
Current Assignee: Baidu USA, LLC
Current Assignee Address: US CA Sunnyvale
Agency: Oppedahl Patent Law Firm LLC
Main IPC: G06V10/82
IPC: G06V10/82 ; G06V10/774 ; G06V10/84

Training energy-based models from a single image for internal learning and inference using trained models

Abstract:

Different from prior works that model the internal distribution of patches within an image implicitly with a top-down latent variable model (e.g., generator), embodiments explicitly represent the statistical distribution within a single image by using an energy-based generative framework, where a pyramid of energy functions, each parameterized by a bottom-up deep neural network, are used to capture the distributions of patches at different resolutions. Also, embodiments of a coarse-to-fine sequential training and sampling strategy are presented to train the model efficiently. Besides learning to generate random samples from white noise, embodiments can learn in parallel with a self-supervised task (e.g., recover an input image from its corrupted version), which can further improve the descriptive power of the learned model. Embodiments does not require an auxiliary model (e.g., discriminator) to assist the training, and embodiments also unify internal statistics learning and image generation in a single framework.

Public/Granted literature

US20220398836A1 TRAINING ENERGY-BASED MODELS FROM A SINGLE IMAGE FOR INTERNAL LEARNING AND INFERENCE USING TRAINED MODELS Public/Granted day:2022-12-15

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06V	图像或视频识别或理解
G06V10/00	图像或视频识别或理解的安排（图像或视频中的字符识别 G06V30/10）
G06V10/70	.使用模式识别或机器学习（光学模式识别或电子计算 G06V10/88）
G06V10/82	..使用神经网络