Method and apparatus for accelerating convolutional neural network

Invention Grant

US12056531B2 Method and apparatus for accelerating convolutional neural network 有权

Please log in to see more content

Patent Title: Method and apparatus for accelerating convolutional neural network
Application No.: US18015308

Application Date: 2020-11-03
Publication No.: US12056531B2

Publication Date: 2024-08-06
Inventor: Bing Xu , Nangeng Zhang
Applicant: CANAAN BRIGHT SIGHT CO., LTD.
Applicant Address: CN Beijing
Assignee: CANAAN BRIGHT SIGHT CO., LTD
Current Assignee: CANAAN BRIGHT SIGHT CO., LTD
Current Assignee Address: CN Beijing
Agency: Nixon & Vanderhye P.C.
Priority: CN 2010652622.8 2020.07.08
International Application: PCT/CN2020/126196 2020.11.03
International Announcement: WO2022/007266A 2022.01.13
Date entered country: 2023-01-09
Main IPC: G06F9/50
IPC: G06F9/50 ; G06F15/80

Method and apparatus for accelerating convolutional neural network

Abstract:

A method and apparatus for accelerating a convolutional neural network. The method comprises: splitting, according to rows, a weight matrix of a convolutional layer into a plurality of weight segments, and respectively caching the plurality of weight segments to a plurality of calculation units in a calculation unit array (step 301); reading a plurality of input data streams respectively corresponding to the plurality of weight segments, and inputting the plurality of input data streams in parallel into the plurality of calculation units (step 302), wherein the input data streams are formed by means of splicing a plurality of rows of data in an input feature map of the convolutional layer; and within each calculation unit, performing a sliding window operation and a multiply-accumulate computation on the input data streams on the basis of the cached weight segments, so as to obtain an output feature map of the convolutional layer (step 303). By means of the method, data that enters each row of calculation units is continuously read according to the whole row of data, and does not need to be read in a cross-row or cross-column manner, such that a special design is not needed for the arrangement of a memory, convolutions of different sizes are supported, and the function of Im2col does not need to be realized, thereby reducing the complexity.

Public/Granted literature

US20230289230A1 METHOD AND APPARATUS FOR ACCELERATING CONVOLUTIONAL NEURAL NETWORK Public/Granted day:2023-09-14

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F9/00	程序控制装置，例如，控制单元（用于外部设备的程序控制入G06F13/10）
G06F9/06	.应用存入的程序的，即应用处理设备的内部存储来接收程序并保持程序的
G06F9/46	..多道程序装置
G06F9/50	...资源分配，例如，中央处理单元[CPU]的