- Patent Title: Systems and methods for accelerating memory transfers and computation efficiency using a computation-informed partitioning of an on-chip data buffer and implementing computation-aware data transfer operations to the on-chip data buffer
-
Application No.: US17902985Application Date: 2022-09-05
-
Publication No.: US11714556B2Publication Date: 2023-08-01
- Inventor: Marian Petre , Aman Sikka , Nigel Drego , Veerbhan Kheterpal , Daniel Firu , Mrinalini Ravichandran
- Applicant: quadric.io, Inc.
- Applicant Address: US CA Burlingame
- Assignee: quadric.io, Inc.
- Current Assignee: quadric.io, Inc.
- Current Assignee Address: US CA Burlingame
- Agency: Chandler Scheitlin Alce PLLC
- Agent Padowithz Alce
- Main IPC: G06N3/08
- IPC: G06N3/08 ; G06F3/06 ; G06F13/16

Abstract:
Systems and methods for implementing accelerated memory transfers in an integrated circuit includes configuring a region of memory of an on-chip data buffer based on a neural network computation graph, wherein configuring the region of memory includes: partitioning the region of memory of the on-chip data buffer to include a first distinct sub-region of memory and a second distinct sub-region of memory; initializing a plurality of distinct memory transfer operations from the off-chip main memory to the on-chip data buffer; executing a first set of memory transfer operations that includes writing a first set of computational components to the first distinct sub-region of memory, and while executing, using the integrated circuit, a leading computation based on the first set of computational components, executing a second set of memory transfer operations to the second distinct sub-region of memory for an impending computation.
Public/Granted literature
Information query