Invention Grant
- Patent Title: Quantizing autoencoders in a neural network
-
Application No.: US16282210Application Date: 2019-02-21
-
Publication No.: US11977388B2Publication Date: 2024-05-07
- Inventor: Jon Hasselgren , Jacob Munkberg
- Applicant: NVIDIA Corporation
- Applicant Address: US CA Santa Clara
- Assignee: NVIDIA Corporation
- Current Assignee: NVIDIA Corporation
- Current Assignee Address: US CA Santa Clara
- Agency: Davis Wright Tremaine LLP
- Main IPC: G06N3/088
- IPC: G06N3/088 ; G05B13/02 ; G05D1/00 ; G06N3/02 ; G06N3/04 ; G06N3/043 ; G06N3/045

Abstract:
The performance of a neural network is improved by applying quantization to data at various points in the network. In an embodiment, a neural network includes two paths. A quantization is applied to each path, such that when an output from each path is combined, further quantization is not required. In an embodiment, the neural network is an autoencoder that includes at least one skip connection. In an embodiment, the system determines a set of quantization parameters based on the characteristics of the data in the primary path and in the skip connection, such that both network paths produce output data in the same fixed point format. As a result, the data from both network paths can be combined without requiring an additional quantization.
Public/Granted literature
- US20200272162A1 QUANTIZING AUTOENCODERS IN A NEURAL NETWORK Public/Granted day:2020-08-27
Information query