Memory layouts and conversion to improve neural network inference performance

Invention Grant

US11645512B2 Memory layouts and conversion to improve neural network inference performance 有权

Please log in to see more content

Patent Title: Memory layouts and conversion to improve neural network inference performance
Application No.: US16399390

Application Date: 2019-04-30
Publication No.: US11645512B2

Publication Date: 2023-05-09
Inventor: Min Guo
Applicant: Baidu USA LLC
Applicant Address: US CA Sunnyvale
Assignee: BAIDU USA LLC
Current Assignee: BAIDU USA LLC
Current Assignee Address: US CA Sunnyvale
Agency: Womble Bond Dickinson (US) LLP
Main IPC: G06N3/08
IPC: G06N3/08 ; G06N5/04

Memory layouts and conversion to improve neural network inference performance

Abstract:

Memory layout and conversion are disclosed to improve neural network (NN) inference performance. For one example, a NN selects a memory layout for a neural network (NN) among a plurality of different memory layouts based on thresholds derived from performance simulations of the NN. The NN stores multi-dimensional NN kernel computation data using the selected memory layout during NN inference. The memory layouts to be selected can be a channel, height, width, and batches (CHWN) layout, a batches, height, width and channel (NHWC) layout, and a batches, channel, height and width (NCHW) layout. If the multi-dimensional NN kernel computation data is not in the selected memory layout, the NN transforms the multi-dimensional NN kernel computation data for the selected memory layout.

Public/Granted literature

US20200349424A1 MEMORY LAYOUTS AND CONVERSION TO IMPROVE NEURAL NETWORK INFERENCE PERFORMANCE Public/Granted day:2020-11-05

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/08	..学习方法