Method for distributed type training adaptation and apparatus in deep learning framework and AI accelerator card

Invention Grant

US11714995B2 Method for distributed type training adaptation and apparatus in deep learning framework and AI accelerator card 有权

Please log in to see more content

Patent Title: Method for distributed type training adaptation and apparatus in deep learning framework and AI accelerator card
Application No.: US17739205

Application Date: 2022-05-09
Publication No.: US11714995B2

Publication Date: 2023-08-01
Inventor: Hongsheng Wang , Hujun Bao , Wei Hua , Weiqiang Jia
Applicant: ZHEJIANG LAB
Applicant Address: CN Zhejiang
Assignee: ZHEJIANG LAB
Current Assignee: ZHEJIANG LAB
Current Assignee Address: CN Hangzhou
Agency: W&G Law Group
Priority: CN 2111487478.8 2021.12.08
Main IPC: G06N3/04
IPC: G06N3/04 ; G06F8/36 ; G06F9/48 ; G06F9/54

Method for distributed type training adaptation and apparatus in deep learning framework and AI accelerator card

Abstract:

Disclosed is a method for distributed type training adaptation and apparatus in a deep learning framework and an AI accelerator card. The method includes the following steps: S1: the deep learning framework supports single-card configuration in a newly added AI accelerator card, and sub-steps thereof are as follows: S11: the deep learning framework supports new hardware; S12: the deep learning framework supports a device thread of the new hardware; S13: the deep learning framework supports a memory operation of the new hardware; and S14: the deep learning framework supports an operator kernel function of the new hardware; S2: the deep learning framework supports multi-card configuration in the newly added AI accelerator card; S3: the deep learning framework supports tensor segmentation and multi-card distribution; and S4: the deep learning framework supports multi-card collective communication in the newly added AI accelerator card.

Public/Granted literature

US20230177312A1 METHOD FOR DISTRIBUTED TYPE TRAINING ADAPTATION AND APPARATUS IN DEEP LEARNING FRAMEWORK AND AI ACCELERATOR CARD Public/Granted day:2023-06-08

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N3/00	基于生物学模型的计算机系统
G06N3/02	.采用神经网络模型
G06N3/04	..体系结构，例如，互连拓扑