Dynamic sequencing of data partitions for optimizing memory utilization and performance of neural networks

Invention Grant

US11722147B2 Dynamic sequencing of data partitions for optimizing memory utilization and performance of neural networks 有权

Please log in to see more content

Patent Title: Dynamic sequencing of data partitions for optimizing memory utilization and performance of neural networks
Application No.: US17583499

Application Date: 2022-01-25
Publication No.: US11722147B2

Publication Date: 2023-08-08
Inventor: Kent D. Cedola , Larry Marvin Wall , Boris Bobrov , George Petre , Chad Balling McBride , Amol Ashok Ambardekar
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
Current Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
Current Assignee Address: US WA Redmond
Agency: Newport IP, LLC
Agent Han K. Gim
Main IPC: H03M7/30
IPC: H03M7/30 ; G06N3/04 ; G06N3/063 ; G06F12/0862 ; G06F9/46 ; G06F1/324 ; G06F3/06 ; G06F9/38 ; G06F12/08 ; G06F12/10 ; G06F15/80 ; G06F17/15 ; G06N3/049 ; G06N3/06 ; G06N3/08 ; G06N3/10 ; H04L45/02 ; H04L67/02 ; G06F9/30 ; H04L67/1001 ; G06F9/48 ; G06F12/02 ; G06N3/045 ; G06N3/065 ; G06F13/16 ; G06F1/3234 ; G06F13/28 ; H03M7/46 ; H04L45/50

Dynamic sequencing of data partitions for optimizing memory utilization and performance of neural networks

Abstract:

Optimized memory usage and management is crucial to the overall performance of a neural network (NN) or deep neural network (DNN) computing environment. Using various characteristics of the input data dimension, an apportionment sequence is calculated for the input data to be processed by the NN or DNN that optimizes the efficient use of the local and external memory components. The apportionment sequence can describe how to parcel the input data (and its associated processing parameters—e.g., processing weights) into one or more portions as well as how such portions of input data (and its associated processing parameters) are passed between the local memory, external memory, and processing unit components of the NN or DNN. Additionally, the apportionment sequence can include instructions to store generated output data in the local and/or external memory components so as to optimize the efficient use of the local and/or external memory components.

Public/Granted literature

US20220147833A1 DYNAMIC SEQUENCING OF DATA PARTITIONS FOR OPTIMIZING MEMORY UTILIZATION AND PERFORMANCE OF NEURAL NETWORKS Public/Granted day:2022-05-12

Information query

Espacenet

IPC分类:

H	电学
H03	基本电子电路
H03M	一般编码、译码或代码转换（用射流方法入F15C4/00；光学模/数转换器入G02F7/00；专用于特殊应用的编码、译码或代码转换见有关小类，例如G01D，G01R，G06F，G06T，G09G，G10L，G11B，G11C，H04B，H04L，H04M，H04N；专用于密码技术或涉及需要保密的其他目的的编码或译码入G09C）
H03M7/00	把用给定序列的数字或给定数目的数字来表示信息的码，转换到用不同序列的数字或不同数目的数字来表示相同信息的码
H03M7/30	.压缩（用于减少冗余的语言分析—合成入G10L19/00；用于图像通信的入H04N）；扩展；消除不需要的数据，例如减少冗余