Patent search ap:("Snap Inc.") AND inv:"Jian Ren" Page 2

11.

发明申请
CONVERSATION-SPECIFIC IMAGE GENERATION BASED ON PROMPTS 有权

公开(公告)号：US20250148674A1

公开(公告)日：2025-05-08

申请号：US18982758

申请日：2024-12-16

Applicant: Snap Inc.

Inventor： Sergey Smetanin , Arnab Ghosh , Pavel Savchenkov , Jian Ren , Sergey Tulyakov , Ivan Babanin , Timur Zakirov , Roman Golobokov , Aleksandr Zakharov , Dor Ayalon , Nikita Demidov , Vladimir Gordienko , Daniel Moreno , Nikita Belosludtcev , Sofya Savinova

IPC: G06T11/60 , G06F3/0482

Abstract: Examples in the present disclosure relate to prompt-driven, conversation-specific image generation. A first user device of a first user of an interaction application transmits an image generation request. The interaction application enables the first user to interact with at least a second user. An image is automatically generated based on a prompt included in the image generation request, and the image is presented at the first user device. In response to receiving user input to select the image, the image is stored as a conversation-specific image in association with a first user profile of the first user and a second user profile of the second user. The conversation-specific image is presented at both the first user device of the first user and a second user device of the second user, together with conversation data associated with a conversation between the first user and the second user.

12.

发明申请
STEP DISTILLATION FOR LATENT DIFFUSION MODELS 有权

公开(公告)号：US20240394843A1

公开(公告)日：2024-11-28

申请号：US18434411

申请日：2024-02-06

Applicant: Snap Inc.

Inventor： Pavlo Chemerys , Colin Eles , Ju Hu , Qing Jin , Yanyu Li , Ergeta Muca , Jian Ren , Dhritiman Sagar , Aleksei Stoliar , Sergey Tulyakov , Huan Wang

IPC: G06T5/60 , G06T5/70

Abstract: Described is a system for improving machine learning models by accessing a first latent diffusion machine learning model, the first latent diffusion machine learning model trained to perform a first number of denoising steps, accessing a second latent diffusion machine learning model that was derived from the first latent diffusion machine learning model, the second latent diffusion machine learning model trained to perform a second number of denoising steps, generating noise data, processing the noise data via the first latent diffusion machine learning model to generate one or more first images, processing the noise data via the second latent diffusion machine learning model to generate one or more second images, and modify a parameter of the second latent diffusion machine learning model based on a comparison of the one or more first images with the one or more second images.

13.

发明授权
Compressing image-to-image models with average smoothing 有权

公开(公告)号：US12154303B2

公开(公告)日：2024-11-26

申请号：US18238979

申请日：2023-08-28

Applicant: Snap Inc.

Inventor： Jian Ren , Menglei Chai , Sergey Tulyakov , Qing Jin

IPC: G06T9/00 , G06N3/08

Abstract: System and methods for compressing image-to-image models. Generative Adversarial Networks (GANs) have achieved success in generating high-fidelity images. An image compression system and method adds a novel variant to class-dependent parameters (CLADE), referred to as CLADE-Avg, which recovers the image quality without introducing extra computational cost. An extra layer of average smoothing is performed between the parameter and normalization layers. Compared to CLADE, this image compression system and method smooths abrupt boundaries, and introduces more possible values for the scaling and shift. In addition, the kernel size for the average smoothing can be selected as a hyperparameter, such as a 3×3 kernel size. This method does not introduce extra multiplications but only addition, and thus does not introduce much computational overhead, as the division can be absorbed into the parameters after training.

14.

发明授权
Machine learning-based selection of a representative video frame within a messaging application 有权

公开(公告)号：US12008811B2

公开(公告)日：2024-06-11

申请号：US17550852

申请日：2021-12-14

Applicant: Snap Inc.

Inventor： Kavya Venkata Kota Kopparapu , Benjamin Dodson , Francesc Xavier Drudis Rius , Angus Kong , Richard Leider , Jian Ren , Sergey Tulyakov , Jiayao Yu

IPC: G06K9/00 , G06F16/783 , G06N20/00 , G06T5/70 , G06V10/70 , G06V20/40 , G06V20/70

CPC classification number: G06V20/46 , G06F16/785 , G06N20/00 , G06T5/70 , G06V10/70 , G06V20/70

Abstract: Aspects of the present disclosure involve a system comprising a medium storing a program and method for machine-learning based selection of a representative video frame. The program and method provide for receiving a set of video frames; determining a first subset of frames by removing frames outside of an image quality threshold; determining a second subset by removing frames outside of an image stillness threshold; computing feature data for each frame in the second subset; providing, for each frame in the second subset, the feature data to a machine learning model (MLM), the MLM being configured to output a score for each frame in the second subset of frames based on the feature data, the MLM having been trained with a first set of images labeled based on aesthetics, and with a second set of images labeled based on image quality; and selecting a frame based on output scores.

15.

发明申请
MOTION REPRESENTATIONS FOR ARTICULATED ANIMATION 有权

公开(公告)号：US20210407163A1

公开(公告)日：2021-12-30

申请号：US17364218

申请日：2021-06-30

Applicant: Snap Inc.

Inventor： Menglei Chai , Jian Ren , Aliaksandr Siarohin , Sergey Tulyakov , Oliver Woodford

IPC: G06T13/00 , G06T7/20 , G06T11/00 , G06N3/04 , G06N3/08

Abstract: Systems and methods herein describe novel motion representations for animating articulated objects consisting of distinct parts. The described systems and method access source image data, identify driving image data to modify image feature data in the source image sequence data, generate, using an image transformation neural network, modified source image data comprising a plurality of modified source images depicting modified versions of the image feature data, the image transformation neural network being trained to identify, for each image in the source image data, a driving image from the driving image data, the identified driving image being implemented by the image transformation neural network to modify a corresponding source image in the source image data using motion estimation differences between the identified driving image and the corresponding source image, and stores the modified source image data.

16.

发明授权
Automatic image generation in an interaction system 有权

公开(公告)号：US12205207B2

公开(公告)日：2025-01-21

申请号：US18176971

申请日：2023-03-01

Applicant: Snap Inc.

Inventor： Sergey Smetanin , Arnab Ghosh , Pavel Savchenkov , Jian Ren , Sergey Tulyakov , Ivan Babanin , Timur Zakirov , Roman Golobokov , Aleksandr Zakharov , Dor Ayalon , Nikita Demidov , Vladimir Gordienko , Daniel Moreno , Nikita Belosludtcev , Sofya Savinova

IPC: G06F3/0482 , G06T11/60

Abstract: Examples disclosed herein describe techniques related to automated image generation in an interaction system. An image generation request is received from a first user device associated with a first user of an interaction system. The image generation request comprises a text prompt. Responsive to receiving the image generation request, an image is automatically generated by an automated text-to-image generator, based on the text prompt. The image is caused to be presented on the first user device. An indication of user input to select the image is received from the user device. Responsive to receiving the indication of the user input to select the image, the image is associated with the first user within the interaction system, and a second user of the interaction system is enabled to be presented with the image.

17.

发明申请
LATENT DIFFUSION MODEL AUTODECODERS 有权

公开(公告)号：US20240395028A1

公开(公告)日：2024-11-28

申请号：US18400677

申请日：2023-12-29

Applicant: Snap Inc.

Inventor： Pavlo Chemerys , Colin Eles , Ju Hu , Qing Jin , Yanyu Li , Ergeta Muca , Jian Ren , Dhritiman Sagar , Aleksei Stoliar , Sergey Tulyakov , Huan Wang

IPC: G06V10/82 , G06N3/0455

Abstract: Described is a system for improving machine learning models. In some cases, the system improves such models by identifying an autoencoder for a latent diffusion machine learning model, the latent diffusion machine learning model is trained to receive text as input and output an image based on the received text. The system identifies a number of channels in a decoder of the autoencoder, the decoder being configured to receive latent features as input and output images. The system further identifies a performance characteristic of the decoder and changes the node topology of the decoder based on the performance characteristic to generate an updated decoder. The system retrains the latent diffusion machine learning model using the updated decoder by inputting latent features to the updated decoder, receiving an outputted image from the updated decoder, and updating one or more weights of the decoder based on an assessment of the outputted image.

18.

发明申请
TEXT-TO-IMAGE DIFFUSION MODEL REARCHITECTURE 有权

公开(公告)号：US20240394932A1

公开(公告)日：2024-11-28

申请号：US18400873

申请日：2023-12-29

Applicant: Snap Inc.

Inventor： Pavlo Chemerys , Colin Eles , Ju Hu , Qing Jin , Yanyu Li , Ergeta Muca , Jian Ren , Dhgritiman Sagar , Aleksei Stoliar , Sergey Tulyakov , Huan Wang

IPC: G06T11/00 , G06N20/00 , G06T5/70 , G10L15/18 , G10L15/22

Abstract: Described is a system for improving machine learning models. In some cases, the system improves such models by identifying a performance characteristic for machine learning model blocks in an iterative denoising process of a machine learning model, connecting a prior machine learning model block with a subsequent machine learning model block of the machine learning model blocks within the machine learning model based on the identified performance characteristic, identifying a prompt of a user, the prompt indicative of an intent of the user for generative images, and analyzing data corresponding to the prompt using the machine learning model to generate one or more images, the machine learning model trained to generate images based on data corresponding to prompts.

19.

发明公开
ASPECT RATIO CONVERSION FOR AUTOMATED IMAGE GENERATION 审中-公开

公开(公告)号：US20240297957A1

公开(公告)日：2024-09-05

申请号：US18115997

申请日：2023-03-01

Applicant: Snap Inc.

Inventor： Mykyta Bakunov , Arnab Ghosh , Pavel Savchenkov , Sergey Smetanin , Jian Ren

IPC: H04N7/01 , G06T3/40 , G06T7/70 , G06T9/00 , G06V10/25

CPC classification number: H04N7/0122 , G06T3/40 , G06T7/70 , G06T9/00 , G06V10/25 , G06T2207/20084 , G06T2207/20132 , G06V2201/07

Abstract: Examples disclosed herein describe aspect ratio conversion techniques for automated image generation. An image generation request comprising a prompt is received from a user device. A processor-implemented automated image generator may generate a first image based on the prompt. The first image has a first aspect ratio. According to some examples, a region of interest is determined in the first image, based on a prompt alignment indicator for the region of interest. The first image is then processed to obtain a second image. The processing includes an automatic cropping operation directed at the region of interest. The second image has a second aspect ratio that is different from the first aspect ratio. The second image is caused to be presented on the user device.

20.

发明授权
Flow-guided motion retargeting 有权

公开(公告)号：US12056792B2

公开(公告)日：2024-08-06

申请号：US17557834

申请日：2021-12-21

Applicant: Snap Inc.

Inventor： Jian Ren , Menglei Chai , Oliver Woodford , Kyle Olszewski , Sergey Tulyakov

IPC: G06T11/00 , G06N3/045 , G06T3/60 , G06T7/194 , G06V40/10

CPC classification number: G06T11/00 , G06N3/045 , G06T3/60 , G06T7/194 , G06V40/10 , G06T2207/20084 , G06T2207/30196

Abstract: Systems and methods herein describe a motion retargeting system. The motion retargeting system accesses a plurality of two-dimensional images comprising a person performing a plurality of body poses, extracts a plurality of implicit volumetric representations from the plurality of body poses, generates a three-dimensional warping field, the three-dimensional warping field configured to warp the plurality of implicit volumetric representations from a canonical pose to a target pose, and based on the three-dimensional warping field, generates a two-dimensional image of an artificial person performing the target pose.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification