Invention Grant
- Patent Title: Method for distributed type training adaptation and apparatus in deep learning framework and AI accelerator card
-
Application No.: US17739205Application Date: 2022-05-09
-
Publication No.: US11714995B2Publication Date: 2023-08-01
- Inventor: Hongsheng Wang , Hujun Bao , Wei Hua , Weiqiang Jia
- Applicant: ZHEJIANG LAB
- Applicant Address: CN Zhejiang
- Assignee: ZHEJIANG LAB
- Current Assignee: ZHEJIANG LAB
- Current Assignee Address: CN Hangzhou
- Agency: W&G Law Group
- Priority: CN 2111487478.8 2021.12.08
- Main IPC: G06N3/04
- IPC: G06N3/04 ; G06F8/36 ; G06F9/48 ; G06F9/54

Abstract:
Disclosed is a method for distributed type training adaptation and apparatus in a deep learning framework and an AI accelerator card. The method includes the following steps: S1: the deep learning framework supports single-card configuration in a newly added AI accelerator card, and sub-steps thereof are as follows: S11: the deep learning framework supports new hardware; S12: the deep learning framework supports a device thread of the new hardware; S13: the deep learning framework supports a memory operation of the new hardware; and S14: the deep learning framework supports an operator kernel function of the new hardware; S2: the deep learning framework supports multi-card configuration in the newly added AI accelerator card; S3: the deep learning framework supports tensor segmentation and multi-card distribution; and S4: the deep learning framework supports multi-card collective communication in the newly added AI accelerator card.
Public/Granted literature
Information query