Invention Grant
- Patent Title: Method for optimizing deep learning operator, device and storage medium
-
Application No.: US17482316Application Date: 2021-09-22
-
Publication No.: US11966451B2Publication Date: 2024-04-23
- Inventor: Bin Li
- Applicant: BEIJING XIAOMI PINECONE ELECTRONICS CO., LTD.
- Applicant Address: CN Beijing
- Assignee: BEIJING XIAOMI PINECONE ELECTRONICS CO., LTD.
- Current Assignee: BEIJING XIAOMI PINECONE ELECTRONICS CO., LTD.
- Current Assignee Address: CN Beijing
- Agency: COZEN O'CONNOR
- Priority: CN 2110221205.2 2021.02.26
- Main IPC: G06F18/21
- IPC: G06F18/21 ; G06N3/04 ; G06T1/60 ; G06V10/94

Abstract:
A method for optimizing a deep learning operator, includes: calling a method of reading an image object to read target data from an L1 cache of an image processor to the processor in response to detecting the target data in the L1 cache, performing a secondary quantization operation on the target data in the processor to obtain an operation result and writing the operation result into a main memory of the image processor. The target data is fixed-point data obtained after performing a quantization operation on data to be quantized in advance and the data to be quantized is one of the following: float-point data of an initial network layer of the neural network model and fixed-point data outputted from a network layer previous to the current network layer.
Public/Granted literature
- US20220277170A1 METHOD FOR OPTIMIZING DEEP LEARNING OPERATOR, DEVICE AND STORAGE MEDIUM Public/Granted day:2022-09-01
Information query