-
公开(公告)号:US20240103879A1
公开(公告)日:2024-03-28
申请号:US17952270
申请日:2022-09-25
Applicant: Advanced Micro Devices, Inc.
Inventor: Bin He , Michael John Mantor , Brian Emberling , Liang Huang , Chao Liu
CPC classification number: G06F9/3887 , G06F9/3001 , G06F9/30043 , G06F9/30098
Abstract: Block data load with transpose techniques are described. In one example, an input is received, at a control unit, specifying an instruction to load a block of data to at least one memory module using a transpose operation. Responsive to the receiving the input by the control unit, the block of data is caused to be loaded to the at least one memory module by transposing the block of data to form a transposed block of data and storing the transposed block of data in the at least one memory.
-
公开(公告)号:US12265915B2
公开(公告)日:2025-04-01
申请号:US17138709
申请日:2020-12-30
Applicant: Advanced Micro Devices, Inc.
Inventor: Chao Liu , Daniel Isamu Lowell , Wen Heng Chung , Jing Zhang
Abstract: A technique for manipulating a generic tensor is provided. The technique includes receiving a first request to perform a first operation on a generic tensor descriptor associated with the generic tensor, responsive to the first request, performing the first operation on the generic tensor descriptor, receiving a second request to perform a second operation on generic tensor raw data associated with the generic tensor, and responsive to the second request, performing the second operation on the generic tensor raw data, the performing the second operation including mapping a tensor coordinate specified by the second request to a memory address, the mapping including evaluating a delta function to determine an address delta value to add to a previously determined address for a previously processed tensor coordinate.
-
公开(公告)号:US12229570B2
公开(公告)日:2025-02-18
申请号:US17952270
申请日:2022-09-25
Applicant: Advanced Micro Devices, Inc.
Inventor: Bin He , Michael John Mantor , Brian Emberling , Liang Huang , Chao Liu
Abstract: Block data load with transpose techniques are described. In one example, an input is received, at a control unit, specifying an instruction to load a block of data to at least one memory module using a transpose operation. Responsive to the receiving the input by the control unit, the block of data is caused to be loaded to the at least one memory module by transposing the block of data to form a transposed block of data and storing the transposed block of data in the at least one memory.
-
公开(公告)号:US12190225B2
公开(公告)日:2025-01-07
申请号:US16779557
申请日:2020-01-31
Applicant: Advanced Micro Devices, Inc.
Inventor: Chao Liu , Daniel Isamu Lowell , Wen Heng Chung , Jing Zhang
Abstract: A technique for manipulating a generic tensor is provided. The technique includes receiving a first request to perform a first operation on a generic tensor descriptor associated with the generic tensor, responsive to the first request, performing the first operation on the generic tensor descriptor, receiving a second request to perform a second operation on generic tensor raw data associated with the generic tensor, and responsive to the second request, performing the second operation on the generic tensor raw data.
-
公开(公告)号:US20210117806A1
公开(公告)日:2021-04-22
申请号:US17138709
申请日:2020-12-30
Applicant: Advanced Micro Devices, Inc.
Inventor: Chao Liu , Daniel Isamu Lowell , Wen Heng Chung , Jing Zhang
Abstract: A technique for manipulating a generic tensor is provided. The technique includes receiving a first request to perform a first operation on a generic tensor descriptor associated with the generic tensor, responsive to the first request, performing the first operation on the generic tensor descriptor, receiving a second request to perform a second operation on generic tensor raw data associated with the generic tensor, and responsive to the second request, performing the second operation on the generic tensor raw data, the performing the second operation including mapping a tensor coordinate specified by the second request to a memory address, the mapping including evaluating a delta function to determine an address delta value to add to a previously determined address for a previously processed tensor coordinate.
-
-
-
-