-
公开(公告)号:US20240393943A1
公开(公告)日:2024-11-28
申请号:US18794436
申请日:2024-08-05
Applicant: Advanced Micro Devices, Inc.
Inventor: Harris Eleftherios Gasparakis
IPC: G06F3/06
Abstract: Generating optimization instructions for data processing pipelines is described. A pipeline optimization system computes resource usage information that describes memory and compute usage metrics during execution of each stage of the data processing pipeline. The system additionally generates data storage information that describes how data output by each pipeline stage is utilized by other stages of the pipeline. The pipeline optimization system then generates the optimization instructions to control how memory operations are performed for a specific data processing pipeline during execution. In implementations, the optimization instructions cause a memory system to discard data (e.g., invalidate cache entries) without copying the discarded data to another storage location after the data is no longer needed by the pipeline. The optimization instructions alternatively or additionally control at least one of evicting, writing-back, or prefetching data to minimize latency during pipeline execution.
-
公开(公告)号:US20240103719A1
公开(公告)日:2024-03-28
申请号:US17955286
申请日:2022-09-28
Applicant: Advanced Micro Devices, Inc.
Inventor: Harris Eleftherios Gasparakis
IPC: G06F3/06
CPC classification number: G06F3/0604 , G06F3/0655 , G06F3/0679
Abstract: Generating optimization instructions for data processing pipelines is described. A pipeline optimization system computes resource usage information that describes memory and compute usage metrics during execution of each stage of the data processing pipeline. The system additionally generates data storage information that describes how data output by each pipeline stage is utilized by other stages of the pipeline. The pipeline optimization system then generates the optimization instructions to control how memory operations are performed for a specific data processing pipeline during execution. In implementations, the optimization instructions cause a memory system to discard data (e.g., invalidate cache entries) without copying the discarded data to another storage location after the data is no longer needed by the pipeline. The optimization instructions alternatively or additionally control at least one of evicting, writing-back, or prefetching data to minimize latency during pipeline execution.
-
公开(公告)号:US12056352B2
公开(公告)日:2024-08-06
申请号:US17955286
申请日:2022-09-28
Applicant: Advanced Micro Devices, Inc.
Inventor: Harris Eleftherios Gasparakis
IPC: G06F3/06
CPC classification number: G06F3/0604 , G06F3/0655 , G06F3/0679
Abstract: Generating optimization instructions for data processing pipelines is described. A pipeline optimization system computes resource usage information that describes memory and compute usage metrics during execution of each stage of the data processing pipeline. The system additionally generates data storage information that describes how data output by each pipeline stage is utilized by other stages of the pipeline. The pipeline optimization system then generates the optimization instructions to control how memory operations are performed for a specific data processing pipeline during execution. In implementations, the optimization instructions cause a memory system to discard data (e.g., invalidate cache entries) without copying the discarded data to another storage location after the data is no longer needed by the pipeline. The optimization instructions alternatively or additionally control at least one of evicting, writing-back, or prefetching data to minimize latency during pipeline execution.
-
公开(公告)号:US20230409868A1
公开(公告)日:2023-12-21
申请号:US17844204
申请日:2022-06-20
Applicant: Advanced Micro Devices, Inc.
Inventor: Hai Xiao , Adam H Li , Harris Eleftherios Gasparakis
Abstract: Activation scaled clipping layers for neural networks are described. An activation scaled clipping layer processes an output of a neuron in a neural network using a scaling parameter and a clipping parameter. The scaling parameter defines how numerical values are amplified relative to zero. The clipping parameter specifies a numerical threshold that causes the neuron output to be expressed as a value defined by the numerical threshold if the neuron output satisfies the numerical threshold. In some implementations, the scaling parameter is linear and treats numbers within a numerical range as being equivalent, such that any number in the range is scaled by a defined magnitude, regardless of value. Alternatively, the scaling parameter is nonlinear, which causes the activation scaled clipping layer to amplify numbers within a range by different magnitudes. Each scaling and clipping parameter is learnable during training of a machine learning model implementing the neural network.
-
-
-