-
公开(公告)号:US20240411695A1
公开(公告)日:2024-12-12
申请号:US18739055
申请日:2024-06-10
Applicant: Apple Inc.
Inventor: Per H. Hammarlund , Eran Tamari , Lior Zimet , Sergio Kolor , Sergio Tota , Sagi Lahav , James Vash , Gaurav Garg , Jonathan M. Redshaw , Steven R. Hutsell , Harshavardhan Kaushikkar , Shawn M. Fukami
IPC: G06F12/0831 , G06F12/0811 , G06F12/0815 , G06F12/109 , G06F12/128 , G06F13/16 , G06F13/28 , G06F13/40 , G06F15/173 , G06F15/78
Abstract: A system including a plurality of processor cores, a plurality of graphics processing units, a plurality of peripheral circuits, and a plurality of memory controllers is configured to support scaling of the system using a unified memory architecture.
-
公开(公告)号:US20230350828A1
公开(公告)日:2023-11-02
申请号:US18309192
申请日:2023-04-28
Applicant: Apple Inc.
Inventor: Sergio Kolor , Sergio V. Tota , Tzach Zemer , Sagi Lahav , Jonathan M. Redshaw , Per H. Hammarlund , Eran Tamari , James Vash , Gaurav Garg , Lior Zimet , Harshavardhan Kaushikkar , Steven Fishwick , Steven R. Hutsell , Shawn M. Fukami
IPC: G06F15/173 , G06F13/40
CPC classification number: G06F13/4027 , G06F13/4022 , G06F15/17375 , G06F15/17381
Abstract: In an embodiment, a system on a chip (SOC) comprises a semiconductor die on which circuitry is formed, wherein the circuitry comprises a plurality of agents and a plurality of network switches coupled to the plurality of agents. The plurality of network switches are interconnected to form a plurality of physical and logically independent networks. A first network of the plurality of physically and logically independent networks is constructed according to a first topology and a second network of the plurality of physically and logically independent networks is constructed according to a second topology that is different from the first topology. For example, the first topology may a ring topology and the second topology may be a mesh topology. In an embodiment, coherency may be enforced on the first network and the second network may be a relaxed order network.
-
公开(公告)号:US20230053530A1
公开(公告)日:2023-02-23
申请号:US17821305
申请日:2022-08-22
Applicant: Apple Inc.
Inventor: Per H. Hammarlund , Eran Tamari , Lior Zimet , Sergio Kolor , Sergio V. Tota , Sagi Lahav , James Vash , Gaurav Garg , Jonathan M. Redshaw , Steven R. Hutsell , Harshavardhan Kaushikkar , Shawn M. Fukami
IPC: G06F15/173 , G06F15/78 , G06F13/16 , G06F13/40
Abstract: A system including a plurality of processor cores, a plurality of graphics processing units, a plurality of peripheral circuits, and a plurality of memory controllers is configured to support scaling of the system using a unified memory architecture.
-
公开(公告)号:US20220036637A1
公开(公告)日:2022-02-03
申请号:US17103317
申请日:2020-11-24
Applicant: Apple Inc.
Inventor: Ali Rabbani Rankouhi , Christopher A. Burns , Justin A. Hensley , Luca Iuliano , Jonathan M. Redshaw
Abstract: Disclosed techniques relate to grouping rays during traversal of a spatially-organized acceleration data structure (e.g., a bounding volume hierarchy) for ray intersection processing. The grouping may provide temporal locality for accesses to bounding region data. In some embodiments, ray intersect circuitry is configured to group rays based on the node of the data structure that they target next. The ray intersect circuitry may select one or more groups of rays for issuance each clock cycle, e.g., to bounding region test circuitry.
-
公开(公告)号:US12217350B2
公开(公告)日:2025-02-04
申请号:US17817742
申请日:2022-08-05
Applicant: Apple Inc.
Inventor: Ali Rabbani Rankouhi , Christopher A. Burns , Justin A. Hensley , Luca Iuliano , Jonathan M. Redshaw
IPC: G06T15/06 , G06F9/38 , G06F9/48 , G06F9/50 , G06F16/22 , G06F30/31 , G06T1/20 , G06T1/60 , G06T15/00 , G06T17/00 , G06Q10/101 , G06Q50/04 , G06T17/10 , G16H40/67
Abstract: Disclosed techniques relate to acceleration data structure for ray intersection testing. In some embodiments, storage circuitry stores node data for a spatially organized acceleration data structure, including to store the following node information for a given node: origin coordinates for the node and, for a given child node of multiple child nodes, child information that includes: quantized bounding region information for a bounding region corresponding to the child node, where the quantized bounding region information encodes bounding region coordinates as offsets relative to the origin coordinates. Traversal circuitry may traverse multiple nodes of the data structure and determine whether a ray intersects a bounding region indicated by given a node of the data structure based on the node information. Disclosed techniques may provide substantial improvements to performance, data size, and power consumption.
-
公开(公告)号:US12182037B2
公开(公告)日:2024-12-31
申请号:US18173500
申请日:2023-02-23
Applicant: Apple Inc.
Inventor: Jonathan M. Redshaw , Winnie W. Yeung , Benjiman L. Goodman , David K. Li , Zelin Zhang , Yoong Chert Foo
IPC: G06F12/126 , G06F12/0811 , G06F12/0891
Abstract: Techniques are disclosed relating to eviction control for cache lines that store register data. In some embodiments, memory hierarchy circuitry is configured to provide memory backing for register operand data in one or more cache circuits. Lock circuitry may control a first set of lock indicators for a set of registers for a first thread, including to assert one or more lock indicators for registers that are indicated, by decode circuitry, as being utilized by decoded instructions of the first thread. The lock circuitry may preserve register operand data in the one or more cache circuits, including to prevent eviction of a given cache line from a cache circuit based on an asserted lock indicator. The lock circuitry may clear the first set of lock indicators in response to a reset event. Disclosed techniques may advantageously retain relevant register information in the cache with limited control circuit area.
-
公开(公告)号:US20240394461A1
公开(公告)日:2024-11-28
申请号:US18791165
申请日:2024-07-31
Applicant: Apple Inc.
Inventor: Haim Hauzi , Eran Tamari , Per H. Hammarlund , Jonathan M. Redshaw , Alfredo Kostianovsky , Idan Nissel , Leonid Gitelman , Oren Betzalel , Dalia Haim , Lior Zimet
IPC: G06F30/398 , G03F1/70 , G06F30/392
Abstract: A chip design methodology and a set of integrated circuits that are taped out from a common design database are disclosed. The area of a full instance of the integrated circuit is defined, and one or more chop lines are defined to identify portions that will be removed for one or more partial instances. A variety of techniques and mechanisms are defined to permit the tape outs to occur from a common design database, so that the effort to tape out partial instances may be minimized beyond that to tape out the full instance.
-
公开(公告)号:US20230053664A1
公开(公告)日:2023-02-23
申请号:US17873694
申请日:2022-07-26
Applicant: Apple Inc.
Inventor: Haim Hauzi , Eran Tamari , Per H. Hammarlund , Jonathan M. Redshaw , Alfredo Kostianovsky , Idan Nissel , Leonid Gitelman , Oren Betzalel , Dalia R. Haim , Lior Zimet
IPC: G06F30/398 , G06F30/392 , G03F1/70
Abstract: A chip design methodology and a set of integrated circuits that are taped out from a common design database are disclosed. The area of a full instance of the integrated circuit is defined, and one or more chop lines are defined to identify portions that will be removed for one or more partial instances. A variety of techniques and mechanisms are defined to permit the tape outs to occur from a common design database, so that the effort to tape out partial instances may be minimized beyond that to tape out the full instance.
-
公开(公告)号:US11521343B2
公开(公告)日:2022-12-06
申请号:US17103462
申请日:2020-11-24
Applicant: Apple Inc.
Inventor: Terence M. Potter , Yoong Chert Foo , Ali Rabbani Rankouhi , Justin A. Hensley , Jonathan M. Redshaw
IPC: G06F15/16 , G06T15/06 , G06T15/00 , G06F9/48 , G06F9/50 , G06T1/20 , G06F16/22 , G06F30/31 , G06F9/38 , G06T1/60 , G06T17/10 , G16H40/67 , G06Q10/10 , G06Q50/04
Abstract: Disclosed techniques relate to memory space management for graphics processing. In some embodiments, first and second graphics cores are configured to execute instructions for multiple threadgroups. In some embodiments, the threads groups include a first threadgroup with multiple single-instruction multiple-data (SIMD) groups configured to execute a first shader program and a second threadgroup with multiple SIMD groups configured to execute a second, different shader program. Control circuitry may be configured to provide access to data stored in memory circuitry according to a shader memory space. The shader memory space may be accessible to threadgroups executed by the first graphics shader core, including the first and second threadgroups, but is not accessible to threadgroups executed by the second graphics shader core. Disclosed techniques may reduce latency, increase bandwidth available to the shader, reduce coherency cost, or any combination thereof.
-
公开(公告)号:US20220036639A1
公开(公告)日:2022-02-03
申请号:US17103433
申请日:2020-11-24
Applicant: Apple Inc.
Inventor: Ali Rabbani Rankouhi , Christopher A. Burns , Justin A. Hensley , Luca Iuliano , Jonathan M. Redshaw
Abstract: Disclosed techniques relate to ray intersection processing for ray tracing. In some embodiments, ray intersection circuitry traverses a spatially organized acceleration data structure and includes bounding region circuitry configured to test, in parallel, whether a ray intersects multiple different bounding regions indicated by a node of the data structure. Shader circuitry may execute a ray intersect instruction to invoke traversal by the ray intersect circuitry and the traversal may generate intersection results. The shader circuitry may shade intersected primitives based on the intersection results. Disclosed techniques that share processing between intersection circuitry and shader processors may improve performance, reduce power consumption, or both, relative to traditional techniques.
-
-
-
-
-
-
-
-
-