Invention Grant
- Patent Title: Dynamically resizing minibatch in neural network execution
-
Application No.: US16362945Application Date: 2019-03-25
-
Publication No.: US11354573B2Publication Date: 2022-06-07
- Inventor: Swagath Venkataramani , Vijayalakshmi Srinivasan , Jungwook Choi
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Scully, Scott, Murphy & Presser, P.C.
- Agent Joseph Petrokaitis
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06N3/04

Abstract:
A minibatch in a neural network execution may be dynamically resized based on on-chip memory. For example, a size of the minibatch is configured such that the minibatch fits within on-chip memory. The size of the minibatch may be resized for a sequence of layers in the neural network execution. A next layer's execution can commence responsive to the resized minibatch being completed in a previous layer without having to wait for all of the minibatch to be completed in the previous layer.
Public/Granted literature
- US20200311536A1 DYNAMICALLY RESIZING MINIBATCH IN NEURAL NETWORK EXECUTION Public/Granted day:2020-10-01
Information query