Invention Grant
- Patent Title: Differential bit width neural architecture search
-
Application No.: US16356928Application Date: 2019-03-18
-
Publication No.: US11604960B2Publication Date: 2023-03-14
- Inventor: Kalin Ovtcharov , Eric S. Chung , Vahideh Akhlaghi , Ritchie Zhao
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Newport IP, LLC
- Agent Leonard J. Hope
- Main IPC: G06N3/063
- IPC: G06N3/063 ; G06N3/04 ; G06N3/084

Abstract:
Machine learning is utilized to learn an optimized quantization configuration for an artificial neural network (ANN). For example, an ANN can be utilized to learn an optimal bit width for quantizing weights for layers of the ANN. The ANN can also be utilized to learn an optimal bit width for quantizing activation values for the layers of the ANN. Once the bit widths have been learned, they can be utilized at inference time to improve the performance of the ANN by quantizing the weights and activation values of the layers of the ANN.
Public/Granted literature
- US20200302269A1 DIFFERENTIAL BIT WIDTH NEURAL ARCHITECTURE SEARCH Public/Granted day:2020-09-24
Information query