Patent search ap:("Graphcore Limited") AND inv:"Daniel John Pelham Wilkinson" Page 1

1.

发明授权
Processing device comprising control bus 有权

公开(公告)号：US11681642B2

公开(公告)日：2023-06-20

申请号：US17328143

申请日：2021-05-24

Applicant: Graphcore Limited

Inventor： Graham Bernard Cunningham , Daniel John Pelham Wilkinson

IPC: G06F13/374 , G06F13/364 , G06F15/173

CPC classification number: G06F13/374 , G06F13/364 , G06F15/17375

Abstract: A device comprising: a control bus; a plurality of requesting circuits each accessible on the control bus, wherein each of the plurality of requesting circuits is operable to dispatch read or write requests to the control bus for delivery to at least one of a plurality of receiving circuits, and the plurality of receiving circuits each accessible on the control bus, and each of which is operable to receive requests from the at least one control bus and service the requests by providing at least one of read or write access to storage associated with the respective receiving circuit, wherein the control bus provides a ring path configured to support, the requests in circulation in the ring path, wherein the control bus is configured to propagate each of at least some of the requests at least until those requests have been serviced by at least one of the receiving circuits.

2.

发明申请
Terminating Distributed Trusted Execution Environment via Confirmation Messages 有权

公开(公告)号：US20230014066A1

公开(公告)日：2023-01-19

申请号：US17305710

申请日：2021-07-13

Applicant: Graphcore Limited

Inventor： Daniel John Pelham Wilkinson , Stavros Volos , Kapil Vaswani , Balaji Vembu

IPC: G06F21/60 , G06F21/53 , G06F21/71 , G06F21/64

Abstract: A method for securely terminating a distributed trusted execution environment (TEE) spanning a plurality of work accelerators. After wiping sensitive data from the memory of its accelerator, a root of trust for each accelerator is configured to receive confirmation that the data has been wiped from the processor memory in relevant other accelerators prior to moving on to the next stage at which the TEE on its associated accelerator is terminated. Since the data has been wiped from the other accelerators, even if a third party were to inject malicious code into the accelerator, they would be unable to read out the secret data from the other accelerators since the data has been wiped from those other accelerators. In this way, a mechanism is provided for ensuring that when the distributed TEE is terminated, malicious third parties are unable to read out confidential data from the accelerators.

3.

发明授权
Booting tiles of processing units 有权

公开(公告)号：US11507386B2

公开(公告)日：2022-11-22

申请号：US16527562

申请日：2019-07-31

Applicant: Graphcore Limited

Inventor： Daniel John Pelham Wilkinson

IPC: G06F9/44 , G06F9/4401

Abstract: A processing system comprises a first subsystem comprising at least one host processor and one or more storage units, and a second subsystem comprising at least one second processor. Each second processor comprises a plurality of tiles. Each tile comprises a processing unit and memory. At least one storage unit stores bootloader code for each of first and second subsets of the plurality of tiles of at least one second processor. The first subsystem writes bootloader code to each of the first subset of tiles of the at least one second processor. At least one of the first subset of tiles requests at least one of the storage units to return the bootloader code to the second subset of the plurality of tiles. Each tile to which the bootloader code is written retrieves boot code from the storage unit and then runs said boot code.

4.

发明授权
Synchronization amongst processor tiles 有权

公开(公告)号：US11023290B2

公开(公告)日：2021-06-01

申请号：US15885972

申请日：2018-02-01

Applicant: Graphcore Limited

Inventor： Daniel John Pelham Wilkinson , Simon Christian Knowles , Matthew David Fyles , Alan Graham Alexander , Stephen Felix

IPC: G06F9/30 , G06F9/52 , G06F9/38 , G06F15/80 , G06N20/00 , G06F9/48

Abstract: A processing system comprising an arrangement of tiles and an interconnect between the tiles. The interconnect comprises synchronization logic for coordinating a barrier synchronization to be performed between a group of the tiles. The instruction set comprises a synchronization instruction taking an operand which selects one of a plurality of available modes each specifying a different membership of the group. Execution of the synchronization instruction cause a synchronization request to be transmitted from the respective tile to the synchronization logic, and instruction issue to be suspended on the respective tile pending a synchronization acknowledgement being received back from the synchronization logic. In response to receiving the synchronization request from all the tiles in the group as specified by the operand of the synchronization instruction, the synchronization logic returns the synchronization acknowledgment to the tiles in the specified group.

5.

发明授权
Synchronization in a multi-tile processing array 有权

公开(公告)号：US10963003B2

公开(公告)日：2021-03-30

申请号：US16165978

申请日：2018-10-19

Applicant: Graphcore Limited

Inventor： Simon Christian Knowles , Daniel John Pelham Wilkinson , Richard Luke Southwell Osborne , Alan Graham Alexander , Stephen Felix , Jonathan Mangnall , David Lacey

IPC: G06F1/12 , G06F9/30 , G06N20/00 , G06N5/02 , G06N5/04

Abstract: The invention relates to a computer comprising: a plurality of processing units each having instruction storage holding a local program, an execution unit executing the local program, data storage for holding data; an input interface with a set of input wires, and an output interface with a set of output wires; a switching fabric connected to each of the processing units by the respective set of output wires and connectable to each of the processing units by the respective input wires via switching circuitry controllable by each processing unit; a synchronisation module operable to generate a synchronisation signal to control the computer to switch between a compute phase and an exchange phase, wherein the processing units are configured to execute their local programs according to a common clock, the local programs being such that in the exchange phase at least one processing unit executes a send instruction from its local program to transmit at a transmit time a data packet onto its output set of connection wires, the data packet being destined for at least one recipient processing unit but having no destination identifier, and at a predetermined switch time the recipient processing unit executes a switch control instruction from its local program to control its switching circuitry to connect its input set of wires to the switching fabric to receive the data packet at a receive time, the transmit time and, switch time and receive time being governed by the common clock with respect to the synchronisation signal.

6.

发明授权
Synchronization and exchange of data between processors 有权

公开(公告)号：US10949266B2

公开(公告)日：2021-03-16

申请号：US16538980

申请日：2019-08-13

Applicant: Graphcore Limited

Inventor： David Lacey , Daniel John Pelham Wilkinson , Richard Luke Southwell Osborne , Matthew David Fyles

IPC: G06F16/901 , G06F9/30 , G06F15/167 , G06F15/173 , G06F9/48 , G06F9/52 , G06F9/38 , G06F9/54 , H04L12/801 , H04L29/06 , H04L29/08

Abstract: A system comprising: a first subsystem comprising one or more first processors, and a second subsystem comprising one or more second processors. The second subsystem is configured to process code over a series of steps delineated by barrier synchronizations, and in a current step, to send a descriptor to the first subsystem specifying a value of each of one or more parameters of each of one or more interactions that the second subsystem is programmed to perform with the first subsystem via an inter-processor interconnect in a subsequent step. The first subsystem is configured to execute a portion of code to perform one or more preparatory operations, based on the specified values of at least one of the one or more parameters of each interaction as specified by the descriptor, to prepare for said one or more interactions prior to the barrier synchronization leading into the subsequent phase.

7.

发明授权
Compiler method 有权

公开(公告)号：US10802536B2

公开(公告)日：2020-10-13

申请号：US15886053

申请日：2018-02-01

Applicant: Graphcore Limited

Inventor： Simon Christian Knowles , Daniel John Pelham Wilkinson , Richard Luke Southwell Osborne , Alan Graham Alexander , Stephen Felix , Jonathan Mangnall , David Lacey

IPC: G06F1/12 , G06F9/52 , G06F9/30 , G06F9/54 , G06F9/38 , G06F15/173 , G06N20/00

Abstract: The invention relates to a computer implemented method of generating multiple programs to deliver a computerised function, each program to be executed in a processing unit of a computer comprising a plurality of processing units each having instruction storage for holding a local program, an execution unit for executing the local program and data storage for holding data, a switching fabric connected to an output interface of each processing unit and connectable to an input interface of each processing unit by switching circuitry controllable by each processing unit, and a synchronisation module operable to generate a synchronisation signal, the method comprising: generating a local program for each processing unit comprising a sequence of executable instructions; determining for each processing unit a relative time of execution of instructions of each local program whereby a local program allocated to one processing unit is scheduled to execute with a predetermined delay relative to a synchronisation signal a send instruction to transmit at least one data packet at a predetermined transmit time, relative to the synchronisation signal, destined for a recipient processing unit but having no destination identifier, and a local program allocated to the recipient processing unit is scheduled to execute at a predetermined switch time a switch control instruction to control the switching circuitry to connect its processing unit wire to the switching fabric to receive the data packet at a receive time.

8.

发明授权
Exchange of data between processor modules 有权

公开(公告)号：US10705998B1

公开(公告)日：2020-07-07

申请号：US16276926

申请日：2019-02-15

Applicant: Graphcore Limited

Inventor： Daniel John Pelham Wilkinson , Alan Graham Alexander

IPC: G06F9/46 , G06F13/40 , G06F9/30 , G06F8/41

Abstract: A processing system comprising: multiple processor modules, each comprising a respective execution unit memory; and an interconnect for exchanging data between different sets of the processor modules. A group of the processor modules operates in a series of BSP supersteps. For the exchange phase of each superstep, each receiving processor module that is to receive data from outside its own set is pre-programmed with a value representing the number of units of data to receive. Starting from the pre-programmed value, it then counts out the number of data units remaining to be received each time a data unit is received. Each receiving processor module is further arranged to perform an exchange synchronization whereby, before advancing from the exchange phase to the compute phase of the current superstep, the receiving processor module waits until no units of data remain to be received according to the count.

9.

发明申请
GATEWAY FABRIC PORTS 审中-公开

公开(公告)号：US20200014560A1

公开(公告)日：2020-01-09

申请号：US16235256

申请日：2018-12-28

Applicant: Graphcore Limited

Inventor： Ola Tørudbakken , Daniel John Pelham Wilkinson , Brian Manula

IPC: H04L12/66 , H04W88/16 , H04W92/06 , G06F9/22 , H04L29/08

Abstract: A gateway for interfacing a host with a subsystem for acting as a work accelerator to the host. The gateway enables the transfer of batches of data to the subsystem at precompiled data exchange synchronisation points. The gateway acts to route data between accelerators which are connected in a scaled system of multiple gateways and accelerators using a global address space set up at compile time of an application to run on the computer system.

10.

发明申请
STREAMING ENGINE 审中-公开

公开(公告)号：US20200012534A1

公开(公告)日：2020-01-09

申请号：US16235515

申请日：2018-12-28

Applicant: Graphcore Limited

Inventor： Ola Tørudbakken , Daniel John Pelham Wilkinson , Richard Luke Southwell Osborne , Brian Manula , Harald Høeg

IPC: G06F9/52 , G06F3/06 , G06N20/00 , G06N5/02

Abstract: A gateway for interfacing a host with a subsystem for acting as a work accelerator to the host. The gateway enables the transfer of batches of data to the subsystem at precompiled data exchange synchronisation points. The gateway comprises a streaming engine having a data mover engine and a memory management engine, the data mover engine and memory management engine being configured to execute instructions in coordination from work descriptors. The memory management engine is configured to execute instructions from the work descriptor to transfer data between external storage and the local memory associated with the gateway. The data mover engine is configured to execute instructions from the work descriptor to transfer data between the local memory associated with the gateway and the subsystem.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification