Relighting images and video using learned lighting and geometry
Abstract:
Novel machine learning (ML) models are introduced for image reconstruction training and inference workflows, which are able to estimate intrinsic components of single view images, including albedo, normal, and lighting components. According to some embodiments, such models may be trained on a mix of real and synthetic image datasets. For training on real datasets, both reconstruction and cross-relighting consistency terms may be imposed. The use of a cross-relighting consistency term allows for the use of multiple images of the same scene—although lit under different lighting conditions—to be used during training. At inference time, the model is able to operate on single or multiple images. According to other embodiments, adversarial training (e.g., in the form of a generative adversarial network (GAN)) may optionally be incorporated into the training workflow, e.g., in order to better refine the re-rendered images from the individual lighting and geometric components estimated by the model.
Public/Granted literature
Information query
Patent Agency Ranking
0/0