Data transmission method for convolution operation, fetcher, and convolution operation apparatus

    公开(公告)号:US11797461B2

    公开(公告)日:2023-10-24

    申请号:US17858506

    申请日:2022-07-06

    CPC classification number: G06F13/1668 G06N3/08

    Abstract: A data transmission method for a convolution operation, and a convolution operation apparatus including a fetcher that includes a loader, at least one sender, a buffer controller, and a reuse buffer. The method includes loading, by the loader, input data of an input feature map according to a loading order, based on input data stored in the reuse buffer, a shape of a kernel to be used for a convolution operation, and two-dimensional (2D) zero-value information of weights of the kernel; storing, by the buffer controller, the loaded input data in the reuse buffer of an address cyclically assigned according to the loading order; and selecting, by each of the at least one sender, input data corresponding to each output data of a convolution operation among the input data stored in the reuse buffer, based on one-dimensional (1D) zero-value information of the weights, and outputting the selected input data.

    Data transmission method for convolution operation, fetcher, and convolution operation apparatus

    公开(公告)号:US11409675B2

    公开(公告)日:2022-08-09

    申请号:US17330229

    申请日:2021-05-25

    Abstract: A data transmission method for a convolution operation, and a convolution operation apparatus including a fetcher that includes a loader, at least one sender, a buffer controller, and a reuse buffer. The method includes loading, by the loader, input data of an input feature map according to a loading order, based on input data stored in the reuse buffer, a shape of a kernel to be used for a convolution operation, and two-dimensional (2D) zero-value information of weights of the kernel; storing, by the buffer controller, the loaded input data in the reuse buffer of an address cyclically assigned according to the loading order; and selecting, by each of the at least one sender, input data corresponding to each output data of a convolution operation among the input data stored in the reuse buffer, based on one-dimensional (1D) zero-value information of the weights, and outputting the selected input data.

    Apparatus and method of performing matrix multiplication operation of neural network

    公开(公告)号:US11899744B2

    公开(公告)日:2024-02-13

    申请号:US16851361

    申请日:2020-04-17

    CPC classification number: G06F17/16 G06F18/213 G06N3/08

    Abstract: A neural network apparatus for performing a matrix multiplication operation includes a memory having at least one program stored therein and a processor to perform one or more operations by executing the at least one program. The processor can determine whether to divide an initial weight in one of a column direction and a row direction according to whether a reshape operation and a transpose operation are performed before or after a matrix multiplication operation and generate division weights by dividing the initial weight by a head count in the determined direction. Also, the processor can generate intermediate feature maps by performing a matrix multiplication operation between the input feature map and the division weights and generate a final feature map based on the intermediate feature maps.

    Method and apparatus with neural network

    公开(公告)号:US11544549B2

    公开(公告)日:2023-01-03

    申请号:US16106703

    申请日:2018-08-21

    Abstract: A processor-implemented neural network method includes calculating individual update values for a weight assigned to a connection relationship between nodes included in a neural network; generating an accumulated update value by accumulating the individual update values in an accumulation buffer; and training the neural network by updating the weight using the accumulated update value in response to the accumulated update value being equal to or greater than a threshold value.

Patent Agency Ranking