- Patent Title: Maximizing resource utilization of neural network computing system
-
Application No.: US16358547Application Date: 2019-03-19
-
Publication No.: US11609792B2Publication Date: 2023-03-21
- Inventor: Lingjie Xu , Wei Wei
- Applicant: ALIBABA GROUP HOLDING LIMITED
- Applicant Address: KY Grand Cayman
- Assignee: ALIBABA GROUP HOLDING LIMITED
- Current Assignee: ALIBABA GROUP HOLDING LIMITED
- Current Assignee Address: KY Grand Cayman
- Agency: Finnegan, Henderson, Farabow, Garrett & Dunner, LLP
- Main IPC: G06F9/50
- IPC: G06F9/50 ; G06N3/04 ; G06F9/48

Abstract:
The present disclosure relates to a method for allocating resources of an accelerator to two or more neural networks for execution. The two or more neural networks may include a first neural network and a second neural network. The method comprises analyzing workloads of the first neural network and the second neural network, wherein the first neural network and second neural network each includes multiple computational layers, evaluating computational resources of the accelerator for executing each computational layer of the first and second neural networks, and scheduling computational resources of the accelerator to execute one computational layer of the multiple computation layers of the first neural network and to execute one or more computational layers of the multiple computational layers of the second neural network.
Public/Granted literature
- US20200301739A1 MAXIMIZING RESOURCE UTILIZATION OF NEURAL NETWORK COMPUTING SYSTEM Public/Granted day:2020-09-24
Information query