-
公开(公告)号:US20240253217A1
公开(公告)日:2024-08-01
申请号:US18538248
申请日:2023-12-13
Applicant: NVIDIA Corporation
Inventor: Arash Vahdat , Hongxu Yin , Jan Kautz , Jiaming Song , Ming-Yu Liu , Morteza Mardani , Qinsheng Zhang
IPC: B25J9/16
CPC classification number: B25J9/163 , B25J9/1664 , B25J9/1697
Abstract: Apparatuses, systems, and techniques to calculate a combined loss value based on applying one or more loss functions to the plurality of samples generated by a diffusion model to update the samples to determine a synthesized motions of one or more objects.
-
公开(公告)号:US12047595B2
公开(公告)日:2024-07-23
申请号:US17955734
申请日:2022-09-29
Applicant: Nvidia Corporation
Inventor: Aurobinda Maharana , Arun Mallya , Ming-Yu Liu , Abhijit Patait
Abstract: Systems and methods herein address reference frame selection in video streaming applications using one or more processing units to decode a frame of an encoded video stream that uses an inter-frame depicting an object and an intra-frame depicting the object, the intra-frame being included in a set of intra-frames based at least in part on at least one attribute of the object as depicted in the intra-frame being different from the at least one attribute of the object as depicted in other intra-frames of the set of intra-frames.
-
公开(公告)号:US20240161403A1
公开(公告)日:2024-05-16
申请号:US18232279
申请日:2023-08-09
Applicant: NVIDIA Corporation
Inventor: Chen-Hsuan Lin , Tsung-Yi Lin , Ming-Yu Liu , Sanja Fidler , Karsten Kreis , Luming Tang , Xiaohui Zeng , Jun Gao , Xun Huang , Towaki Takikawa
CPC classification number: G06T17/20 , G06T3/40 , G06T15/04 , G06T17/005 , G06T19/20
Abstract: Text-to-image generation generally refers to the process of generating an image from one or more text prompts input by a user. While artificial intelligence has been a valuable tool for text-to-image generation, current artificial intelligence-based solutions are more limited as it relates to text-to-3D content creation. For example, these solutions are oftentimes category-dependent, or synthesize 3D content at a low resolution. The present disclosure provides a process and architecture for high-resolution text-to-3D content creation.
-
公开(公告)号:US11625613B2
公开(公告)日:2023-04-11
申请号:US17143516
申请日:2021-01-07
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
公开(公告)号:US20230045076A1
公开(公告)日:2023-02-09
申请号:US17389113
申请日:2021-07-29
Applicant: Nvidia Corporation
Inventor: Xun Huang , Arun Mallya , Ting Wang , Ming-Yu Liu
Abstract: Apparatuses, systems, and techniques are presented to generate one or more images. In at least one embodiment, one or more neural networks are used to generate one or more images based, at least in part, upon one or more input types.
-
公开(公告)号:US11496773B2
公开(公告)日:2022-11-08
申请号:US17352064
申请日:2021-06-18
Applicant: NVIDIA Corporation
Inventor: Yi-Hsuan Tsai , Ming-Yu Liu , Deqing Sun , Ming-Hsuan Yang , Jan Kautz
IPC: H04N19/85 , H04N19/91 , H04N19/436 , H04N19/46
Abstract: A method, computer readable medium, and system are disclosed for identifying residual video data. This data describes data that is lost during a compression of original video data. For example, the original video data may be compressed and then decompressed, and this result may be compared to the original video data to determine the residual video data. This residual video data is transformed into a smaller format by means of encoding, binarizing, and compressing, and is sent to a destination. At the destination, the residual video data is transformed back into its original format and is used during the decompression of the compressed original video data to improve a quality of the decompressed original video data.
-
公开(公告)号:US20220012536A1
公开(公告)日:2022-01-13
申请号:US17483688
申请日:2021-09-23
Applicant: NVIDIA Corporation
Inventor: Ting-Chun Wang , Ming-Yu Liu , Bryan Christopher Catanzaro , Jan Kautz , Andrew J. Tao
Abstract: A method, computer readable medium, and system are disclosed for creating an image utilizing a map representing different classes of specific pixels within a scene. One or more computing systems use the map to create a preliminary image. This preliminary image is then compared to an original image that was used to create the map. A determination is made whether the preliminary image matches the original image, and results of the determination are used to adjust the computing systems that created the preliminary image, which improves a performance of such computing systems. The adjusted computing systems are then used to create images based on different input maps representing various object classes of specific pixels within a scene.
-
公开(公告)号:US20210358164A1
公开(公告)日:2021-11-18
申请号:US16875748
申请日:2020-05-15
Applicant: NVIDIA Corporation
Inventor: Ming-Yu Liu , Kuniaki Saito
Abstract: Apparatuses, systems, and techniques to facilitate application of a style, for which one or more neural networks have not been trained by a training framework, from one image to content of another image. In at least one embodiment, a styled output image is generated by one or more neural networks based on a style contained in a style image and content of a content image where said one or more neural networks have not been trained by a training framework on said style.
-
公开(公告)号:US11082720B2
公开(公告)日:2021-08-03
申请号:US16191174
申请日:2018-11-14
Applicant: NVIDIA Corporation
Inventor: Yi-Hsuan Tsai , Ming-Yu Liu , Deqing Sun , Ming-Hsuan Yang , Jan Kautz
IPC: H04N19/85 , H04N19/91 , H04N19/436 , H04N19/46
Abstract: A method, computer readable medium, and system are disclosed for identifying residual video data. This data describes data that is lost during a compression of original video data. For example, the original video data may be compressed and then decompressed, and this result may be compared to the original video data to determine the residual video data. This residual video data is transformed into a smaller format by means of encoding, binarizing, and compressing, and is sent to a destination. At the destination, the residual video data is transformed back into its original format and is used during the decompression of the compressed original video data to improve a quality of the decompressed original video data.
-
公开(公告)号:US20210150354A1
公开(公告)日:2021-05-20
申请号:US17143608
申请日:2021-01-07
Applicant: NVIDIA Corporation
Inventor: Tero Tapani Karras , Samuli Matias Laine , David Patrick Luebke , Jaakko T. Lehtinen , Miika Samuli Aittala , Timo Oskari Aila , Ming-Yu Liu , Arun Mohanray Mallya , Ting-Chun Wang
Abstract: A latent code defined in an input space is processed by the mapping neural network to produce an intermediate latent code defined in an intermediate latent space. The intermediate latent code may be used as appearance vector that is processed by the synthesis neural network to generate an image. The appearance vector is a compressed encoding of data, such as video frames including a person's face, audio, and other data. Captured images may be converted into appearance vectors at a local device and transmitted to a remote device using much less bandwidth compared with transmitting the captured images. A synthesis neural network at the remote device reconstructs the images for display.
-
-
-
-
-
-
-
-
-