- Patent Title: Dynamic multi-layer execution for artificial intelligence modeling
-
Application No.: US16588779Application Date: 2019-09-30
-
Publication No.: US11354579B2Publication Date: 2022-06-07
- Inventor: Bharadwaj Pudipeddi , Marc Tremblay , Sujeeth Subramanya Bharadwaj , Jinwen Xi , Maral Mesmakhosroshahi
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Fiala & Weaver P.L.L.C.
- Main IPC: G06F15/16
- IPC: G06F15/16 ; G06N3/10 ; H04L67/10 ; G06N3/08

Abstract:
Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. This paradigm of executing one portion of the AI model at a time allows for dynamic execution of the large AI model.
Public/Granted literature
- US20210019634A1 DYNAMIC MULTI-LAYER EXECUTION FOR ARTIFICIAL INTELLIGENCE MODELING Public/Granted day:2021-01-21
Information query