Invention Grant
- Patent Title: Maximized memory throughput using cooperative thread arrays
- Patent Title (中): 使用协作线程数组最大化内存吞吐量
-
Application No.: US11748298Application Date: 2007-05-14
-
Publication No.: US07925860B1Publication Date: 2011-04-12
- Inventor: Norbert Juffa , Brett W. Coon
- Applicant: Norbert Juffa , Brett W. Coon
- Applicant Address: US CA Santa Clara
- Assignee: NVIDIA Corporation
- Current Assignee: NVIDIA Corporation
- Current Assignee Address: US CA Santa Clara
- Agency: Kilpatrick Townsend & Stockton LLP
- Main IPC: G06F9/30
- IPC: G06F9/30

Abstract:
In parallel processing devices, for streaming computations, processing of each data element of the stream may not be computationally intensive and thus processing may take relatively small amounts of time to compute as compared to memory accesses times required to read the stream and write the results. Therefore, memory throughput often limits the performance of the streaming computation. Generally stated, provided are methods for achieving improved, optimized, or ultimately, maximized memory throughput in such memory-throughput-limited streaming computations. Streaming computation performance is maximized by improving the aggregate memory throughput across the plurality of processing elements and threads. High aggregate memory throughput is achieved by balancing processing loads between threads and groups of threads and a hardware memory interface coupled to the parallel processing devices.
Information query