-
公开(公告)号:US20230153949A1
公开(公告)日:2023-05-18
申请号:US17525739
申请日:2021-11-12
Applicant: Nvidia Corporation
Inventor: Xun Huang , Zinan Lin , Ming-Yu Liu
IPC: G06T5/00
CPC classification number: G06T5/002 , G06T2207/20084 , G06T2207/20081 , G06T2207/20076
Abstract: Apparatuses, systems, and techniques are presented to generate one or more images. In at least one embodiment, one or more neural networks are used to generate one or more images based, at least in part, on one or noise values.
-
公开(公告)号:US20250166237A1
公开(公告)日:2025-05-22
申请号:US18518430
申请日:2023-11-22
Applicant: NVIDIA Corporation
Inventor: Yu Zeng , Yogesh Balaji , Ting-Chun Wang , Xun Huang , Ming-Yu Liu
Abstract: Apparatuses, processors, computing systems, devices, non-transitory computer medium, and/or methods for using neural networks for generating multiple related images. In at least one embodiment, a processor includes circuitry to use one or more neural networks to generate several images, where each image includes a same object (e.g., same subject) and different backgrounds. For example, a processor including one or more circuits to use one or more neural networks to generate one or more objects (e.g., an animal, a vehicle, a person) within two or more different images (e.g., different backgrounds such as weather, season, environment) based, at least in part, on one or more indications (e.g., text prompts) by one or more users indicating content of at least one of the two or more different images (e.g., objects and/or backgrounds for each image in text such as adjectives and nouns) other than the one or more objects.
-
公开(公告)号:US20240161403A1
公开(公告)日:2024-05-16
申请号:US18232279
申请日:2023-08-09
Applicant: NVIDIA Corporation
Inventor: Chen-Hsuan Lin , Tsung-Yi Lin , Ming-Yu Liu , Sanja Fidler , Karsten Kreis , Luming Tang , Xiaohui Zeng , Jun Gao , Xun Huang , Towaki Takikawa
CPC classification number: G06T17/20 , G06T3/40 , G06T15/04 , G06T17/005 , G06T19/20
Abstract: Text-to-image generation generally refers to the process of generating an image from one or more text prompts input by a user. While artificial intelligence has been a valuable tool for text-to-image generation, current artificial intelligence-based solutions are more limited as it relates to text-to-3D content creation. For example, these solutions are oftentimes category-dependent, or synthesize 3D content at a low resolution. The present disclosure provides a process and architecture for high-resolution text-to-3D content creation.
-
公开(公告)号:US20230045076A1
公开(公告)日:2023-02-09
申请号:US17389113
申请日:2021-07-29
Applicant: Nvidia Corporation
Inventor: Xun Huang , Arun Mallya , Ting Wang , Ming-Yu Liu
Abstract: Apparatuses, systems, and techniques are presented to generate one or more images. In at least one embodiment, one or more neural networks are used to generate one or more images based, at least in part, upon one or more input types.
-
公开(公告)号:US20240054609A1
公开(公告)日:2024-02-15
申请号:US18195784
申请日:2023-05-10
Applicant: NVIDIA Corporation
Inventor: Xun Huang , Ming-Yu Liu
CPC classification number: G06T5/50 , G06T7/10 , G06V10/82 , G06T2207/20221
Abstract: Apparatuses, systems, and techniques to generate images. In at least one embodiment, one or more neural networks are used to generate a panoramic image from a segmentation mask.
-
公开(公告)号:US20220237838A1
公开(公告)日:2022-07-28
申请号:US17159977
申请日:2021-01-27
Applicant: Nvidia Corporation
Inventor: Ming-Yu Liu , Xun Huang
Abstract: Apparatuses, systems, and techniques are presented to synthesize representations. In at least one embodiment, one or more neural networks are used to generate one or more representations of one or more objects based, at least in part, upon one or more structural features and one or more appearance features for the one or more objects.
-
公开(公告)号:US20220108417A1
公开(公告)日:2022-04-07
申请号:US17061041
申请日:2020-10-01
Applicant: Nvidia Corporation
Inventor: Ming-Yu Liu , Xun Huang
Abstract: Apparatuses, systems, and techniques are presented to generate images. In at least one embodiment, one or more neural networks are used to generate one or more images based, at least in part, upon speech input received from one or more users.
-
公开(公告)号:US20200242736A1
公开(公告)日:2020-07-30
申请号:US16261395
申请日:2019-01-29
Applicant: Nvidia Corporation
Inventor: Ming-Yu Liu , Xun Huang , Tero Karras , Timo Aila , Jaakko Lehtinen
Abstract: A few-shot, unsupervised image-to-image translation (“FUNIT”) algorithm is disclosed that accepts as input images of previously-unseen target classes. These target classes are specified at inference time by only a few images, such as a single image or a pair of images, of an object of the target type. A FUNIT network can be trained using a data set containing images of many different object classes, in order to translate images from one class to another class by leveraging few input images of the target class. By learning to extract appearance patterns from the few input images for the translation task, the network learns a generalizable appearance pattern extractor that can be applied to images of unseen classes at translation time for a few-shot image-to-image translation task.
-
公开(公告)号:US20190279075A1
公开(公告)日:2019-09-12
申请号:US16279671
申请日:2019-02-19
Applicant: Nvidia Corporation
Inventor: Ming-Yu Liu , Xun Huang
Abstract: A source image is processed using an encoder network to determine a content code representative of a visual aspect of the source object represented in the source image. A target class is determined, which can correspond to an entire population of objects of a particular type. The user may specify specific objects within the target class, or a sampling can be done to select objects within the target class to use for the translation. Style codes for the selected target objects are determined that are representative of the appearance of those target objects. The target style codes are provided with the source content code as input to a translation network, which can use the codes to infer a set of images including representations of the selected target objects having the visual aspect determined from the source image.
-
-
-
-
-
-
-
-