Patent search aee:"MicroUnity Systems Engineering Page Inc."

81.

发明申请
Processor architecture with wide operand cache 有权
Title translation: 具有宽操作数缓存的处理器架构

公开(公告)号：US20090100227A1

公开(公告)日：2009-04-16

申请号：US11982051

申请日：2007-10-31

Applicant: Craig Hansen , John Moussouris , Alexia Massalin

Inventor： Craig Hansen , John Moussouris , Alexia Massalin

IPC: G06F12/00

CPC classification number: G06F9/3001 , G03F1/36 , G06F9/30 , G06F9/30007 , G06F9/30014 , G06F9/30018 , G06F9/30029 , G06F9/30032 , G06F9/30036 , G06F9/3004 , G06F9/30043 , G06F9/30054 , G06F9/30098 , G06F9/30101 , G06F9/30109 , G06F9/30112 , G06F9/30145 , G06F9/30149 , G06F9/3016 , G06F9/30167 , G06F9/35 , G06F9/383 , G06F9/3851 , G06F9/3861 , G06F9/3885 , G06F9/4484 , G06F9/45533 , G06F12/02 , G06F17/5068 , G06F17/5072 , G06F17/5081 , G06F2217/12 , H03M13/158 , H03M13/4169 , Y02D10/13 , Y02D10/26 , Y02D10/28 , Y02P90/265

Abstract: A programmable processor and method for improving the performance of processors by expanding at least two source operands, or a source and a result operand, to a width greater than the width of either the general purpose register or the data path width. The present invention provides operands which are substantially larger than the data path width of the processor by using the contents of a general purpose register to specify a memory address at which a plurality of data path widths of data can be read or written, as well as the size and shape of the operand. In addition, several instructions and apparatus for implementing these instructions are described which obtain performance advantages if the operands are not limited to the width and accessible number of general purpose registers.

Abstract translation: 一种可编程处理器和方法，用于通过将至少两个源操作数或源和结果操作数扩展到大于通用寄存器或数据路径宽度的宽度的宽度来提高处理器的性能。本发明通过使用通用寄存器的内容来指定可以读取或写入数据的多个数据路径宽度的存储器地址，并且基本上大于处理器的数据路径宽度的操作数，以及操作数的大小和形状。此外，描述了用于实现这些指令的几个指令和装置，其如果操作数不限于通用寄存器的宽度和可访问数量，则获得性能优点。

82.

发明申请
Processor architecture for executing transfers between wide operand memories 审中-公开
Title translation: 用于执行广泛操作数存储器之间传输的处理器架构

公开(公告)号：US20090089540A1

公开(公告)日：2009-04-02

申请号：US11982202

申请日：2007-10-31

Applicant: Craig Hansen , John Moussouris , Alexia Massalin

Inventor： Craig Hansen , John Moussouris , Alexia Massalin

IPC: G06F15/76 , G06F9/02

CPC classification number: G06F9/30014 , G06F9/30007 , G06F9/30021 , G06F9/30029 , G06F9/30032 , G06F9/30036 , G06F9/30043 , G06F9/30101 , G06F9/30109 , G06F9/30112 , G06F9/30145 , G06F9/3016 , G06F9/30167 , G06F9/32 , G06F9/34 , G06F9/3824 , G06F9/383 , G06F9/3885 , G06F12/0886

Abstract: A programmable processor and method for improving the performance of processors by expanding at least two source operands, or a source and a result operand, to a width greater than the width of either the general purpose register or the data path width. The present invention provides operands which are substantially larger than the data path width of the processor by using the contents of a general purpose register to specify a memory address at which a plurality of data path widths of data can be read or written, as well as the size and shape of the operand. In addition, several instructions and apparatus for implementing these instructions are described which obtain performance advantages if the operands are not limited to the width and accessible number of general purpose registers.

Abstract translation: 一种可编程处理器和方法，用于通过将至少两个源操作数或源和结果操作数扩展到大于通用寄存器或数据路径宽度的宽度的宽度来提高处理器的性能。本发明通过使用通用寄存器的内容来指定可以读取或写入数据的多个数据路径宽度的存储器地址，并且基本上大于处理器的数据路径宽度的操作数，以及操作数的大小和形状。此外，描述了用于实现这些指令的几个指令和装置，其如果操作数不限于通用寄存器的宽度和可访问数量，则获得性能优点。

83.

发明授权
System and method to implement a matrix multiply unit of a broadband processor 失效
Title translation: 实现宽带处理器的矩阵乘法单元的系统和方法

公开(公告)号：US07483935B2

公开(公告)日：2009-01-27

申请号：US10233779

申请日：2002-09-04

Applicant: Craig Hansen , Bruce Bateman , John Moussouris

Inventor： Craig Hansen , Bruce Bateman , John Moussouris

IPC: G06F7/00 , G06F7/52

CPC classification number: G06F17/16 , G06F7/5334 , G06F7/724 , G06F9/3001 , G06F9/30036

Abstract: The present invention provides a system and method for improving the performance of general-purpose processors by implementing a functional unit that computes the product of a matrix operand with a vector operand, producing a vector result. The functional unit fully utilizes the entire resources of a 128b by 128b multiplier regardless of the operand size, as the number of elements of the matrix and vector operands increase as operand size is reduced. The unit performs both fixed-point and floating-point multiplications and additions with the highest-possible intermediate accuracy with modest resources.

Abstract translation: 本发明提供了一种用于通过实现一个功能单元来提高通用处理器的性能的系统和方法，所述功能单元使用向量操作数来计算矩阵操作数的乘积，产生向量结果。功能单元完全利用128b乘128b乘法器的全部资源，无论操作数大小如何，因为矩阵和向量操作数的元素数量随着操作数大小的减小而增加。该单元通过适度的资源执行具有最高可能的中间精度的定点和浮点乘法和补充。

84.

发明授权
Method and software for multithreaded processor with partitioned operations 有权
Title translation: 具有划分操作的多线程处理器的方法和软件

公开(公告)号：US07430655B2

公开(公告)日：2008-09-30

申请号：US10757515

申请日：2004-01-15

Applicant: Craig Hansen , John Moussouris

Inventor： Craig Hansen , John Moussouris

IPC: G06F9/46

CPC classification number: G06F9/30014 , G06F9/30018 , G06F9/30025 , G06F9/30029 , G06F9/30032 , G06F9/30036 , G06F9/3004 , G06F9/30043 , G06F9/30054 , G06F9/30087 , G06F9/30101 , G06F9/30109 , G06F9/30112 , G06F9/3012 , G06F9/30123 , G06F9/30145 , G06F9/3016 , G06F9/30167 , G06F9/3816 , G06F9/3824 , G06F9/383 , G06F9/3851 , G06F9/3861 , G06F9/3873 , G06F9/3885 , G06F15/7832

Abstract: A system and software for improving the performance of processors by incorporating an execution unit configurable to execute a plurality of instruction streams from the plurality of threads, wherein each instruction stream includes a group instruction that operates on a plurality of data elements in partitioned fields of at least one of the registers to produce a catenated result.

Abstract translation: 一种用于通过结合执行单元来配置以执行来自多个线程的多个指令流的执行单元来提高处理器的性能的系统和软件，其中每个指令流包括对分组字段中的多个数据元素进行操作的组指令至少有一个寄存器产生连带结果。

85.

发明申请
Method and Apparatus for Programmable Processor 审中-公开
Title translation: 可编程处理器的方法和装置

公开(公告)号：US20080072020A1

公开(公告)日：2008-03-20

申请号：US11841964

申请日：2007-08-20

Applicant: Craig Hansen , John Moussouris

Inventor： Craig Hansen , John Moussouris

IPC: G06F9/302

CPC classification number: G06F9/30014 , G06F9/30018 , G06F9/30025 , G06F9/30029 , G06F9/30032 , G06F9/30036 , G06F9/3004 , G06F9/30043 , G06F9/30054 , G06F9/30087 , G06F9/30101 , G06F9/30109 , G06F9/30112 , G06F9/3012 , G06F9/30123 , G06F9/30145 , G06F9/3016 , G06F9/30167 , G06F9/3816 , G06F9/3824 , G06F9/383 , G06F9/3851 , G06F9/3861 , G06F9/3873 , G06F9/3885 , G06F15/7832

Abstract: Systems and apparatuses are presented relating a programmable processor comprising an execution unit that is operable to decode and execute instructions received from an instruction path and partition data stored in registers in the register file into multiple data elements, the execution unit capable of executing a plurality of different group floating-point and group integer arithmetic operations that each arithmetically operates on multiple data elements stored registers in a register file to produce a catenated result that is returned to a register in the register file, wherein the catenated result comprises a plurality of individual results, wherein the execution unit is capable of executing group data handling operations that re-arrange data elements in different ways in response to data handling instructions.

Abstract translation: 提出了一种系统和装置，其涉及包括执行单元的可编程处理器，该执行单元可操作以解码和执行从指令路径接收的指令，并将存储在寄存器堆中的寄存器中的数据分割成多个数据元素，该执行单元能够执行多个不同的组浮点和组整数算术运算，每个算术运算在寄存器文件中的多个数据元素存储的寄存器上产生返回到寄存器文件中的寄存器的连接结果，其中，连接结果包括多个单独的结果其中，所述执行单元能够执行响应于数据处理指令以不同方式重新布置数据元素的组数据处理操作。

86.

发明申请
Method and Apparatus for Improved Programmable Processor 审中-公开
Title translation: 改进的可编程处理器的方法和装置

公开(公告)号：US20080059766A1

公开(公告)日：2008-03-06

申请号：US11842006

申请日：2007-10-29

Applicant: Craig Hansen , John Moussouris

Inventor： Craig Hansen , John Moussouris

IPC: G06F9/302 , G06F15/76

CPC classification number: G06F9/30014 , G06F9/30018 , G06F9/30025 , G06F9/30029 , G06F9/30032 , G06F9/30036 , G06F9/3004 , G06F9/30043 , G06F9/30054 , G06F9/30087 , G06F9/30101 , G06F9/30109 , G06F9/30112 , G06F9/3012 , G06F9/30123 , G06F9/30145 , G06F9/3016 , G06F9/30167 , G06F9/3816 , G06F9/3824 , G06F9/383 , G06F9/3851 , G06F9/3861 , G06F9/3873 , G06F9/3885 , G06F15/7832

Abstract: Systems and apparatuses are presented relating a programmable processor comprising an execution unit that is operable to decode and execute instructions received from an instruction path and partition data stored in registers in the register file into multiple data elements, the execution unit capable of executing a plurality of different group floating-point and group integer arithmetic operations that each arithmetically operates on multiple data elements stored registers in a register file to produce a catenated result that is returned to a register in the register file, wherein the catenated result comprises a plurality of individual results, wherein the execution unit is capable of executing group data handling operations that re-arrange data elements in different ways in response to data handling instructions.

Abstract translation: 提出了一种系统和装置，其涉及包括执行单元的可编程处理器，该执行单元可操作以解码和执行从指令路径接收的指令，并将存储在寄存器堆中的寄存器中的数据分割成多个数据元素，该执行单元能够执行多个不同的组浮点和组整数算术运算，每个算术运算在寄存器文件中的多个数据元素存储的寄存器上产生返回到寄存器文件中的寄存器的连接结果，其中，连接结果包括多个单独的结果其中，所述执行单元能够执行响应于数据处理指令以不同方式重新布置数据元素的组数据处理操作。

87.

发明授权
Programmable processor and method for matched aligned and unaligned storage instructions 有权
Title translation: 可编程处理器和方法，用于匹配的对齐和未对齐存储指令

公开(公告)号：US07222225B2

公开(公告)日：2007-05-22

申请号：US10716561

申请日：2003-11-20

Applicant: Craig Hansen , John Moussouris

Inventor： Craig Hansen , John Moussouris

IPC: G06F9/34

CPC classification number: G06F9/30014 , G06F9/30018 , G06F9/30025 , G06F9/30029 , G06F9/30032 , G06F9/30036 , G06F9/3004 , G06F9/30043 , G06F9/30054 , G06F9/30087 , G06F9/30101 , G06F9/30109 , G06F9/30112 , G06F9/3012 , G06F9/30123 , G06F9/30145 , G06F9/3016 , G06F9/30167 , G06F9/3816 , G06F9/3824 , G06F9/383 , G06F9/3851 , G06F9/3861 , G06F9/3873 , G06F9/3885 , G06F15/7832

Abstract: A programmable processor and method for improving the performance of processors by incorporating an execution unit operable to decode and execute single instructions in an instruction set comprising (a) group instructions that operate on a plurality of data elements in partitioned fields of a register to produce a catenated result, (b) aligned memory operations that move data between memory and register where the memory operand is aligned, and (c) unaligned memory operations where the memory operand is unaligned.

Abstract translation: 一种可编程处理器和方法，用于通过结合执行单元来操作以解码和执行指令集中的单个指令来提高处理器的性能，所述指令集包括：（a）对寄存器的分割字段中的多个数据元素进行操作的组指令，连接结果，（b）对齐的存储器操作，其在存储器和寄存器之间移动数据，存储器操作数对齐，以及（c）存储器操作数未对齐的未对齐的存储器操作。

88.

发明申请
Multithreaded programmable processor and system with partitioned operations 有权
Title translation: 多线程可编程处理器和具有分区操作的系统

公开(公告)号：US20040210745A1

公开(公告)日：2004-10-21

申请号：US10757939

申请日：2004-01-16

Applicant: MICROUNITY SYSTEMS ENGINEERING, INC.

Inventor： Craig Hansen , John Moussouris

IPC: G06F009/00

CPC classification number: G06F9/30014 , G06F9/30018 , G06F9/30025 , G06F9/30029 , G06F9/30032 , G06F9/30036 , G06F9/3004 , G06F9/30043 , G06F9/30054 , G06F9/30087 , G06F9/30101 , G06F9/30109 , G06F9/30112 , G06F9/3012 , G06F9/30123 , G06F9/30145 , G06F9/3016 , G06F9/30167 , G06F9/3816 , G06F9/3824 , G06F9/383 , G06F9/3851 , G06F9/3861 , G06F9/3873 , G06F9/3885 , G06F15/7832

Abstract: A programmable processor and method for improving the performance of processors by incorporating an execution unit configurable to execute a plurality of instruction streams from the plurality of threads, wherein each instruction stream includes a group instruction that operates on a plurality of data elements in partitioned fields of at least one of the registers to produce a catenated result.

Abstract translation: 一种可编程处理器和方法，用于通过结合执行单元来提高处理器的性能，所述执行单元可配置为执行来自所述多个线程的多个指令流，其中每个指令流包括对分割字段中的多个数据元素进行操作的组指令至少有一个寄存器产生连带结果。

89.

发明申请
Method and software for store multiplex operation 失效
Title translation: 存储复用操作的方法和软件

公开(公告)号：US20040205325A1

公开(公告)日：2004-10-14

申请号：US10757866

申请日：2004-01-16

Applicant: MICROUNITY SYSTEMS ENGINEERING, INC.

Inventor： Craig Hansen , John Moussouris

IPC: G06F009/00

CPC classification number: G06F9/30014 , G06F9/30018 , G06F9/30025 , G06F9/30029 , G06F9/30032 , G06F9/30036 , G06F9/3004 , G06F9/30043 , G06F9/30054 , G06F9/30087 , G06F9/30101 , G06F9/30109 , G06F9/30112 , G06F9/3012 , G06F9/30123 , G06F9/30145 , G06F9/3016 , G06F9/30167 , G06F9/3816 , G06F9/3824 , G06F9/383 , G06F9/3851 , G06F9/3861 , G06F9/3873 , G06F9/3885 , G06F15/7832

Abstract: A system and software for improving the performance of processors by incorporating an execution unit operable to decode and execute single instructions specifying both a mask and a register containing data, the mask comprising fields that each correspond to a field of the data contained in the register, the execution unit is operable to detect some of the fields of the mask as having a predetermined value and identifying corresponding fields of the data contained in the register as write-enabled data fields; and cause the write-enabled data fields to be written to a specified memory location.

Abstract translation: 一种用于通过结合执行单元的系统和软件，所述执行单元可操作以解码和执行指定掩模和包含数据的寄存器的单个指令，所述掩码包括各自对应于包含在所述寄存器中的数据的字段的字段，所述执行单元可操作以将所述掩模的一些场检测为具有预定值，并将包含在所述寄存器中的数据的相应字段识别为可写入数据字段; 并使写入使能的数据字段被写入指定的存储器位置。

90.

发明申请
Method and software for partitioned floating-point multiply-add operation 失效
Title translation: 用于分区浮点乘法运算的方法和软件

公开(公告)号：US20040205324A1

公开(公告)日：2004-10-14

申请号：US10757851

申请日：2004-01-16

Applicant: MICROUNITY SYSTEMS ENGINEERING, INC.

Inventor： Craig Hansen , John Moussouris

IPC: G06F009/44

CPC classification number: G06F9/30014 , G06F9/30018 , G06F9/30025 , G06F9/30029 , G06F9/30032 , G06F9/30036 , G06F9/3004 , G06F9/30043 , G06F9/30054 , G06F9/30087 , G06F9/30101 , G06F9/30109 , G06F9/30112 , G06F9/3012 , G06F9/30123 , G06F9/30145 , G06F9/3016 , G06F9/30167 , G06F9/3816 , G06F9/3824 , G06F9/383 , G06F9/3851 , G06F9/3861 , G06F9/3873 , G06F9/3885 , G06F15/7832

Abstract: A method and software for improving the performance of processors by incorporating an execution unit operable to decode and execute single instructions specifying three registers each containing a plurality of data elements, the execution unit operable to multiply the first and second registers and add the third register to produce a catenated result containing a plurality of data elements. Additional instructions provide group floating-point subtract, add, multiply, set less, and set greater equal operations. The set less and set greater equal operations produce alternatively zero or an identity element for each element of a catenated result, the result facilitating alternative selection of individual data elements using bitwise Boolean operations and without requiring conditional branch operations.

Abstract translation: 一种用于通过并入执行单元的方法和软件，该执行单元可操作以解码和执行指定三个寄存器的单个指令，每个寄存器包含多个数据元素，所述执行单元可操作以将第一和第二寄存器相乘并将第三寄存器添加到产生包含多个数据元素的连接结果。附加指令提供组浮点减法，加，乘，少设置和设置更大的相等操作。集合较少和设置较大相等的操作为连接结果的每个元素产生替代零或身份元素，结果便于使用逐位布尔运算而不需要条件分支操作来替代选择各个数据元素。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification