Invention Grant
- Patent Title: Memory layouts and conversion to improve neural network inference performance
-
Application No.: US16399390Application Date: 2019-04-30
-
Publication No.: US11645512B2Publication Date: 2023-05-09
- Inventor: Min Guo
- Applicant: Baidu USA LLC
- Applicant Address: US CA Sunnyvale
- Assignee: BAIDU USA LLC
- Current Assignee: BAIDU USA LLC
- Current Assignee Address: US CA Sunnyvale
- Agency: Womble Bond Dickinson (US) LLP
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N5/04

Abstract:
Memory layout and conversion are disclosed to improve neural network (NN) inference performance. For one example, a NN selects a memory layout for a neural network (NN) among a plurality of different memory layouts based on thresholds derived from performance simulations of the NN. The NN stores multi-dimensional NN kernel computation data using the selected memory layout during NN inference. The memory layouts to be selected can be a channel, height, width, and batches (CHWN) layout, a batches, height, width and channel (NHWC) layout, and a batches, channel, height and width (NCHW) layout. If the multi-dimensional NN kernel computation data is not in the selected memory layout, the NN transforms the multi-dimensional NN kernel computation data for the selected memory layout.
Public/Granted literature
- US20200349424A1 MEMORY LAYOUTS AND CONVERSION TO IMPROVE NEURAL NETWORK INFERENCE PERFORMANCE Public/Granted day:2020-11-05
Information query