Patent search ap:("INTEL CORPORATION") AND inv:"Supratim Pal" Page 5

41.

发明授权
Use of a single instruction set architecture (ISA) instruction for vector normalization 有权

公开(公告)号：US11157238B2

公开(公告)日：2021-10-26

申请号：US16685561

申请日：2019-11-15

Applicant: Intel Corporation

Inventor： Abhishek Rhisheekesan , Supratim Pal , Shashank Lakshminarayana , Subramaniam Maiyuran

IPC: G06F7/552 , G06F9/30

Abstract: Embodiments described herein are generally directed to an improved vector normalization instruction. An embodiment of a method includes responsive to receipt by a GPU of a single instruction specifying a vector normalization operation to be performed on V vectors: (i) generating V squared length values, N at a time, by a first processing unit, by, for each N sets of inputs, each representing multiple component vectors for N of the vectors, performing N parallel dot product operations on the N sets of inputs. Generating V sets of outputs representing multiple normalized component vectors of the V vectors, N at a time, by a second processing unit, by, for each N squared length values of the V squared length values, performing N parallel operations on the N squared length values, wherein each of the N parallel operations implement a combination of a reciprocal square root function and a vector scaling function.

42.

发明申请
SPARSE MATRIX OPTIMIZATION MECHANISM 有权

公开(公告)号：US20210183002A1

公开(公告)日：2021-06-17

申请号：US16710081

申请日：2019-12-11

Applicant: Intel Corporation

Inventor： Namita Sharma , Supratim Pal , Biju P. Simon , Tovinakere D. Vivek

IPC: G06T1/20

Abstract: An apparatus to facilitate matrix processing is disclosed. The apparatus comprises a matrix accelerator to receive input matrix data, transform the input matrix data into a plurality of sub-blocks, examine a first block of the sub-blocks to determine whether the first block comprises sparse data, select a first tile size upon a determination that the first block comprises sparse data and generate output matrix data based on the first tile size.

43.

发明授权
Control flow mechanism for execution of graphics processor instructions using active channel packing 有权

公开(公告)号：US10990409B2

公开(公告)日：2021-04-27

申请号：US15493442

申请日：2017-04-21

Applicant: Intel Corporation

Inventor： Subramaniam M. Maiyuran , Guei-Yuan Lueh , Supratim Pal , Gang Chen , Ananda V. Kommaraju , Joy Chandra , Altug Koker , Prasoonkumar Surti , David Puffer , Hong Bin Liao , Joydeep Ray , Abhishek R. Appu , Ankur N. Shah , Travis T. Schluessler , Jonathan Kennedy , Devan Burke

IPC: G06F9/38 , G06F9/30 , G06T1/20

Abstract: An apparatus to facilitate control flow in a graphics processing system is disclosed. The apparatus includes logic a plurality of execution units to execute single instruction, multiple data (SIMD) and flow control logic to detect a diverging control flow in a plurality of SIMD channels and reduce the execution of the control flow to a subset of the SIMD channels.

44.

发明申请
ACCUMULATOR POOLING MECHANISM 审中-公开

公开(公告)号：US20200320662A1

公开(公告)日：2020-10-08

申请号：US16378047

申请日：2019-04-08

Applicant: Intel Corporation

Inventor： Guei-Yuan Lueh , Subramaniam Maiyuran , Wei-Yu Chen , Konrad Trifunovic , Supratim Pal , Chandra S. Gurram , Jorge E. Parra , Pratik J. Ashar , Tomasz Bujewski

IPC: G06T1/60 , G06F9/30 , G06T1/20

Abstract: A processor is disclosed. The processor includes an execution unit having a register file having one or more banks of registers to store operand values, an accumulator comprising a pool of registers to store operand values determined to cause a conflict at register banks within the register file and cache circuitry to control storage of the operand values determined to cause a conflict at the register banks from the register file to the pool of registers.

45.

发明申请
HIERARCHICAL GENERAL REGISTER FILE (GRF) FOR EXECUTION BLOCK 审中-公开

公开(公告)号：US20200026514A1

公开(公告)日：2020-01-23

申请号：US16526147

申请日：2019-07-30

Applicant: Intel Corporation

Inventor： Abhishek R. Appu , Altug Koker , Joydeep Ray , Kamal Sinha , Kiran C. Veernapu , Subramaniam Maiyuran , Prasoonkumar Surti , Guei-Yuan Lueh , David Puffer , Supratim Pal , Eric J. Hoekstra , Travis T. Schluessler , Linda L. Hurd

IPC: G06F9/30 , G06F9/38 , G06T15/00 , G06T1/60 , G06T1/20 , G06F9/46 , G09G5/36

Abstract: In an example, an apparatus comprises a plurality of execution units, and a first general register file (GRF) communicatively couple to the plurality of execution units, wherein the first GRF is shared by the plurality of execution units. Other embodiments are also disclosed and claimed.

46.

发明授权
Instruction that performs a scatter write 有权

公开(公告)号：US09880839B2

公开(公告)日：2018-01-30

申请号：US14261097

申请日：2014-04-24

Applicant: Intel Corporation

Inventor： Wei-Yu Chen , Guei-Yuan Lueh , Subramaniam Maiyuran , Supratim Pal

IPC: G06F9/30

CPC classification number: G06F9/30032 , G06F9/30036 , G06F9/30043

Abstract: A processor is described having an instruction execution pipeline. The instruction execution pipeline has an instruction fetch stage to fetch an instruction specifying multiple target resultant registers. The instruction execution pipeline has an instruction decode stage to decode the instruction. The instruction execution pipeline has a functional unit to prepare resultant content specific to each of the multiple target resultant registers. The instruction execution pipeline has a write-back stage to write back said resultant content specific to each of said multiple target resultant registers.

47.

发明申请
DYNAMIC THREAD SPLITTING 审中-公开
Title translation: 动态螺纹分割

公开(公告)号：US20170010894A1

公开(公告)日：2017-01-12

申请号：US14794521

申请日：2015-07-08

Applicant: Intel Corporation

Inventor： Hema C. Nalluri , Supratim Pal , Subramaniam Maiyuran , Joy Chandra

IPC: G06F9/30

Abstract: Systems, apparatuses and methods may provide for associating a first instruction pointer with an IF block of a primary IF-ELSE conditional construct associated with a thread and activating a second instruction pointer in response to a dependency associated with the IF block. Additionally, the second instruction pointer may be associated with an ELSE block of the primary IF-ELSE conditional construct. In one example, the IF block and the ELSE block are executed, via the first instruction pointer and the second instruction pointer, one or more of independently from or parallel to one another.

Abstract translation: 系统，装置和方法可以提供将第一指令指针与与线程相关联的主IF-ELSE条件结构的IF块相关联，并且响应于与IF块相关联的依赖关系来激活第二指令指针。另外，第二指令指针可以与主IF-ELSE条件构造的ELSE块相关联。在一个示例中，IF块和ELSE块经由第一指令指针和第二指令指针执行独立于或彼此平行的一个或多个。

48.

发明申请
SYSTOLIC ARRAY OF ARBITRARY PHYSICAL AND LOGICAL DEPTH 有权

公开(公告)号：US20250117360A1

公开(公告)日：2025-04-10

申请号：US18931412

申请日：2024-10-30

Applicant: Intel Corporation

Inventor： Jorge Parra , Wei-yu Chen , Kaiyu Chen , Varghese George , Junjie Gu , Chandra Gurram , Guei-Yuan Lueh , Stephen Junkins , Subramaniam Maiyuran , Supratim Pal

IPC: G06F15/80 , G06F9/30

Abstract: A processing apparatus includes a processing resource including a general-purpose parallel processing engine and a matrix accelerator. The matrix accelerator includes first circuitry to receive a command to perform operations associated with an instruction, second circuitry to configure the matrix accelerator according to a physical depth of a systolic array within the matrix accelerator and a logical depth associated with the instruction, third circuitry to read operands for the instruction from a register file associated with the systolic array, fourth circuitry to perform operations for the instruction via one or more passes through one or more physical pipeline stages of the systolic array based on a configuration performed by the second circuitry, and fifth circuitry to write output of the operations to the register file associated with the systolic array.

49.

发明申请
DUAL PIPELINE PARALLEL SYSTOLIC ARRAY 有权

公开(公告)号：US20250117359A1

公开(公告)日：2025-04-10

申请号：US18913758

申请日：2024-10-11

Applicant: Intel Corporation

Inventor： Jorge Parra , Jiasheng Chen , Supratim Pal , Fangwen Fu , Sabareesh Ganapathy , Chandra Gurram , Chunhui Mei , Yue Qi

IPC: G06F9/38 , G06F9/30

Abstract: A processing apparatus described herein includes a general-purpose parallel processing engine comprising a systolic array having multiple pipelines, each of the multiple pipelines including multiple pipeline stages, wherein the multiple pipelines include a first pipeline, a second pipeline, and a common input shared between the first pipeline and the second pipeline.

50.

发明申请
AVOIDING THE USE OF A RESULT CROSSBAR WHEN DOWN CONVERTING TO PACKED REGISTER FORMATS 有权

公开(公告)号：US20250036412A1

公开(公告)日：2025-01-30

申请号：US18358308

申请日：2023-07-25

Applicant: Intel Corporation

Inventor： Supratim Pal , Jiasheng Chen , Christopher Spencer , Jorge E. Parra Osorio , Kevin Hurd , Guei-Yuan Lueh , Pradeep K. Golconda , Fangwen Fu , Wei Xiong , Hongzheng Li , James Valerio , Mukundan Swaminathan , Nicholas Murphy , Shuai Mu , Clifford Gibson , Buqi Cheng

IPC: G06F9/30 , G06F9/38

Abstract: Described herein is a graphics processor comprising a memory interface and a graphics processing cluster coupled with the memory interface. The graphics processing cluster includes a plurality of processing resources. A processing resource of the plurality of processing resources includes a source crossbar communicatively coupled with a register file, the source crossbar to reorder data elements of a source operand and a format conversion pipeline to convert a plurality of input data elements specified by the source operand from a first format of a plurality of datatype formats to a second format of the plurality of datatype formats, the plurality of datatype formats including integer and floating-point formats.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification