-
公开(公告)号:US20250148674A1
公开(公告)日:2025-05-08
申请号:US18982758
申请日:2024-12-16
Applicant: Snap Inc.
Inventor: Sergey Smetanin , Arnab Ghosh , Pavel Savchenkov , Jian Ren , Sergey Tulyakov , Ivan Babanin , Timur Zakirov , Roman Golobokov , Aleksandr Zakharov , Dor Ayalon , Nikita Demidov , Vladimir Gordienko , Daniel Moreno , Nikita Belosludtcev , Sofya Savinova
IPC: G06T11/60 , G06F3/0482
Abstract: Examples in the present disclosure relate to prompt-driven, conversation-specific image generation. A first user device of a first user of an interaction application transmits an image generation request. The interaction application enables the first user to interact with at least a second user. An image is automatically generated based on a prompt included in the image generation request, and the image is presented at the first user device. In response to receiving user input to select the image, the image is stored as a conversation-specific image in association with a first user profile of the first user and a second user profile of the second user. The conversation-specific image is presented at both the first user device of the first user and a second user device of the second user, together with conversation data associated with a conversation between the first user and the second user.
-
公开(公告)号:US20240394843A1
公开(公告)日:2024-11-28
申请号:US18434411
申请日:2024-02-06
Applicant: Snap Inc.
Inventor: Pavlo Chemerys , Colin Eles , Ju Hu , Qing Jin , Yanyu Li , Ergeta Muca , Jian Ren , Dhritiman Sagar , Aleksei Stoliar , Sergey Tulyakov , Huan Wang
Abstract: Described is a system for improving machine learning models by accessing a first latent diffusion machine learning model, the first latent diffusion machine learning model trained to perform a first number of denoising steps, accessing a second latent diffusion machine learning model that was derived from the first latent diffusion machine learning model, the second latent diffusion machine learning model trained to perform a second number of denoising steps, generating noise data, processing the noise data via the first latent diffusion machine learning model to generate one or more first images, processing the noise data via the second latent diffusion machine learning model to generate one or more second images, and modify a parameter of the second latent diffusion machine learning model based on a comparison of the one or more first images with the one or more second images.
-
公开(公告)号:US12154303B2
公开(公告)日:2024-11-26
申请号:US18238979
申请日:2023-08-28
Applicant: Snap Inc.
Inventor: Jian Ren , Menglei Chai , Sergey Tulyakov , Qing Jin
Abstract: System and methods for compressing image-to-image models. Generative Adversarial Networks (GANs) have achieved success in generating high-fidelity images. An image compression system and method adds a novel variant to class-dependent parameters (CLADE), referred to as CLADE-Avg, which recovers the image quality without introducing extra computational cost. An extra layer of average smoothing is performed between the parameter and normalization layers. Compared to CLADE, this image compression system and method smooths abrupt boundaries, and introduces more possible values for the scaling and shift. In addition, the kernel size for the average smoothing can be selected as a hyperparameter, such as a 3×3 kernel size. This method does not introduce extra multiplications but only addition, and thus does not introduce much computational overhead, as the division can be absorbed into the parameters after training.
-
14.
公开(公告)号:US12008811B2
公开(公告)日:2024-06-11
申请号:US17550852
申请日:2021-12-14
Applicant: Snap Inc.
Inventor: Kavya Venkata Kota Kopparapu , Benjamin Dodson , Francesc Xavier Drudis Rius , Angus Kong , Richard Leider , Jian Ren , Sergey Tulyakov , Jiayao Yu
Abstract: Aspects of the present disclosure involve a system comprising a medium storing a program and method for machine-learning based selection of a representative video frame. The program and method provide for receiving a set of video frames; determining a first subset of frames by removing frames outside of an image quality threshold; determining a second subset by removing frames outside of an image stillness threshold; computing feature data for each frame in the second subset; providing, for each frame in the second subset, the feature data to a machine learning model (MLM), the MLM being configured to output a score for each frame in the second subset of frames based on the feature data, the MLM having been trained with a first set of images labeled based on aesthetics, and with a second set of images labeled based on image quality; and selecting a frame based on output scores.
-
公开(公告)号:US20210407163A1
公开(公告)日:2021-12-30
申请号:US17364218
申请日:2021-06-30
Applicant: Snap Inc.
Inventor: Menglei Chai , Jian Ren , Aliaksandr Siarohin , Sergey Tulyakov , Oliver Woodford
Abstract: Systems and methods herein describe novel motion representations for animating articulated objects consisting of distinct parts. The described systems and method access source image data, identify driving image data to modify image feature data in the source image sequence data, generate, using an image transformation neural network, modified source image data comprising a plurality of modified source images depicting modified versions of the image feature data, the image transformation neural network being trained to identify, for each image in the source image data, a driving image from the driving image data, the identified driving image being implemented by the image transformation neural network to modify a corresponding source image in the source image data using motion estimation differences between the identified driving image and the corresponding source image, and stores the modified source image data.
-
公开(公告)号:US12205207B2
公开(公告)日:2025-01-21
申请号:US18176971
申请日:2023-03-01
Applicant: Snap Inc.
Inventor: Sergey Smetanin , Arnab Ghosh , Pavel Savchenkov , Jian Ren , Sergey Tulyakov , Ivan Babanin , Timur Zakirov , Roman Golobokov , Aleksandr Zakharov , Dor Ayalon , Nikita Demidov , Vladimir Gordienko , Daniel Moreno , Nikita Belosludtcev , Sofya Savinova
IPC: G06F3/0482 , G06T11/60
Abstract: Examples disclosed herein describe techniques related to automated image generation in an interaction system. An image generation request is received from a first user device associated with a first user of an interaction system. The image generation request comprises a text prompt. Responsive to receiving the image generation request, an image is automatically generated by an automated text-to-image generator, based on the text prompt. The image is caused to be presented on the first user device. An indication of user input to select the image is received from the user device. Responsive to receiving the indication of the user input to select the image, the image is associated with the first user within the interaction system, and a second user of the interaction system is enabled to be presented with the image.
-
公开(公告)号:US20240395028A1
公开(公告)日:2024-11-28
申请号:US18400677
申请日:2023-12-29
Applicant: Snap Inc.
Inventor: Pavlo Chemerys , Colin Eles , Ju Hu , Qing Jin , Yanyu Li , Ergeta Muca , Jian Ren , Dhritiman Sagar , Aleksei Stoliar , Sergey Tulyakov , Huan Wang
IPC: G06V10/82 , G06N3/0455
Abstract: Described is a system for improving machine learning models. In some cases, the system improves such models by identifying an autoencoder for a latent diffusion machine learning model, the latent diffusion machine learning model is trained to receive text as input and output an image based on the received text. The system identifies a number of channels in a decoder of the autoencoder, the decoder being configured to receive latent features as input and output images. The system further identifies a performance characteristic of the decoder and changes the node topology of the decoder based on the performance characteristic to generate an updated decoder. The system retrains the latent diffusion machine learning model using the updated decoder by inputting latent features to the updated decoder, receiving an outputted image from the updated decoder, and updating one or more weights of the decoder based on an assessment of the outputted image.
-
公开(公告)号:US20240394932A1
公开(公告)日:2024-11-28
申请号:US18400873
申请日:2023-12-29
Applicant: Snap Inc.
Inventor: Pavlo Chemerys , Colin Eles , Ju Hu , Qing Jin , Yanyu Li , Ergeta Muca , Jian Ren , Dhgritiman Sagar , Aleksei Stoliar , Sergey Tulyakov , Huan Wang
Abstract: Described is a system for improving machine learning models. In some cases, the system improves such models by identifying a performance characteristic for machine learning model blocks in an iterative denoising process of a machine learning model, connecting a prior machine learning model block with a subsequent machine learning model block of the machine learning model blocks within the machine learning model based on the identified performance characteristic, identifying a prompt of a user, the prompt indicative of an intent of the user for generative images, and analyzing data corresponding to the prompt using the machine learning model to generate one or more images, the machine learning model trained to generate images based on data corresponding to prompts.
-
公开(公告)号:US20240297957A1
公开(公告)日:2024-09-05
申请号:US18115997
申请日:2023-03-01
Applicant: Snap Inc.
Inventor: Mykyta Bakunov , Arnab Ghosh , Pavel Savchenkov , Sergey Smetanin , Jian Ren
CPC classification number: H04N7/0122 , G06T3/40 , G06T7/70 , G06T9/00 , G06V10/25 , G06T2207/20084 , G06T2207/20132 , G06V2201/07
Abstract: Examples disclosed herein describe aspect ratio conversion techniques for automated image generation. An image generation request comprising a prompt is received from a user device. A processor-implemented automated image generator may generate a first image based on the prompt. The first image has a first aspect ratio. According to some examples, a region of interest is determined in the first image, based on a prompt alignment indicator for the region of interest. The first image is then processed to obtain a second image. The processing includes an automatic cropping operation directed at the region of interest. The second image has a second aspect ratio that is different from the first aspect ratio. The second image is caused to be presented on the user device.
-
公开(公告)号:US12056792B2
公开(公告)日:2024-08-06
申请号:US17557834
申请日:2021-12-21
Applicant: Snap Inc.
Inventor: Jian Ren , Menglei Chai , Oliver Woodford , Kyle Olszewski , Sergey Tulyakov
CPC classification number: G06T11/00 , G06N3/045 , G06T3/60 , G06T7/194 , G06V40/10 , G06T2207/20084 , G06T2207/30196
Abstract: Systems and methods herein describe a motion retargeting system. The motion retargeting system accesses a plurality of two-dimensional images comprising a person performing a plurality of body poses, extracts a plurality of implicit volumetric representations from the plurality of body poses, generates a three-dimensional warping field, the three-dimensional warping field configured to warp the plurality of implicit volumetric representations from a canonical pose to a target pose, and based on the three-dimensional warping field, generates a two-dimensional image of an artificial person performing the target pose.
-
-
-
-
-
-
-
-
-