-
1.
公开(公告)号:EP4465214A1
公开(公告)日:2024-11-20
申请号:EP24176797.9
申请日:2024-05-17
Applicant: Lemon Inc.
Inventor: LIU, Tongping , XU, Wei , CHEN, Jianjun
Abstract: System and method of training a machine learning model on a plurality of devices in parallel are provided. The method includes performing a model profiling execution before a model normal execution, allocating (360) tensors of the model into a plurality of chunks based on profiling results from the model profiling execution, and performing the model normal execution on the plurality of devices in parallel to train or fine-tune the model.