Method and apparatus for optimizing advanced encryption standard (aes) encryption and decryption in parallel mode of operation
    1.
    发明专利
    Method and apparatus for optimizing advanced encryption standard (aes) encryption and decryption in parallel mode of operation 有权
    用于优化并行运行模式下的高级加密标准(AES)加密和分解的方法和装置

    公开(公告)号:JP2009205161A

    公开(公告)日:2009-09-10

    申请号:JP2009043695

    申请日:2009-02-26

    CPC classification number: H04L9/0631 G06F9/30007 H04L2209/125

    Abstract: PROBLEM TO BE SOLVED: To optimize advanced encryption standard (AES) encryption and decryption in parallel modes of operation.
    SOLUTION: The throughput of an encryption/decryption operation is increased in a system having a pipelined execution unit. Different independent encryptions (decryptions) of different data blocks may be performed in parallel by despatching an AES round instruction in each cycle.
    COPYRIGHT: (C)2009,JPO&INPIT

    Abstract translation: 要解决的问题:以并行运行模式优化先进的加密标准(AES)加密和解密。 解决方案:在具有流水线执行单元的系统中,加密/解密操作的吞吐量增加。 可以通过在每个周期中发送AES循环指令来并行地执行不同数据块的不同的独立加密(解密)。 版权所有(C)2009,JPO&INPIT

    Efficient parallel floating point exception processing by processor
    2.
    发明专利
    Efficient parallel floating point exception processing by processor 审中-公开
    加工者有效的平行浮点处理

    公开(公告)号:JP2014002767A

    公开(公告)日:2014-01-09

    申请号:JP2013161489

    申请日:2013-08-02

    CPC classification number: G06F9/3861 G06F9/30014 G06F9/30036

    Abstract: PROBLEM TO BE SOLVED: To efficiently process a floating point exception by a processor which executes an SIMD instruction.SOLUTION: Processing includes the steps of: specifying a numeral exception of an SIMD floating point operation; starting a first SIMD micro operation so as to generate a first Packed part result of the SIMD floating point operation; starting a second SIMD micro operation so as to generate a second Packed part result of the SIMD floating point operation; starting an SIMD unnormalization micro operation so as to put the first and second Packed part results together and to generate a third Packed result having an unnormalized element by unnormalizing a first element of the first and second Packed part results having been put together; storing the third Packed result of the SIMD floating point operation; and setting a flag for specifying the unnormalized element of the third Packed result to the first Packed part result.

    Abstract translation: 要解决的问题:通过执行SIMD指令的处理器有效地处理浮点异常。解决方案:处理包括以下步骤:指定SIMD浮点运算的数字异常; 开始第一SIMD微操作,以产生SIMD浮点运算的第一打包部分结果; 开始第二SIMD微操作,以产生SIMD浮点运算的第二打包部分结果; 开始SIMD非归一化微操作,以将第一和第二填充部分结果合并在一起,并通过将已经放在一起的第一和第二填充零件结果的第一元素非规范化来生成具有非归一化元素的第三填充结果; 存储SIMD浮点运算的第三个打包结果; 并且将用于将第三打包结果的非标准化元素指定为第一包装部分结果的标志。

    Efficient parallel floating-point exception handling in processor
    3.
    发明专利
    Efficient parallel floating-point exception handling in processor 审中-公开
    处理器中有效的平行浮点处理处理

    公开(公告)号:JP2010020761A

    公开(公告)日:2010-01-28

    申请号:JP2009152985

    申请日:2009-06-26

    CPC classification number: G06F9/3861 G06F9/30014 G06F9/30036

    Abstract: PROBLEM TO BE SOLVED: To efficiently handle floating-point exceptions in a processor that executes single-instruction multiple-data (SIMD) instructions. SOLUTION: A method is provided for handling the floating-point exceptions in the processor that executes the SIMD instructions. The method includes: a step of initiating a first SIMD micro-operation to generate first packed partial results of the SIMD operation; a step of initiating a second SIMD micro-operation to generate second packed partial results of the SIMD operation; a step of initiating a SIMD denormalization micro-operation to combine the first and the second packed partial results and to generate third packed results having denormalized elements by denormalizing a first element of first and second packed partial results; a step of storing third packed results of the SIMD operation; and a step of setting a flag to identify denormalized elements of the third packed results in the first packed partial results. COPYRIGHT: (C)2010,JPO&INPIT

    Abstract translation: 要解决的问题:有效处理执行单指令多数据(SIMD)指令的处理器中的浮点异常。 解决方案:提供了一种处理执行SIMD指令的处理器中的浮点异常的方法。 该方法包括:启动第一SIMD微操作以产生SIMD操作的第一打包部分结果的步骤; 启动第二SIMD微操作以产生SIMD操作的第二打包部分结果的步骤; 启动SIMD非规范化微操作以组合第一和第二打包部分结果并通过对第一和第二打包部分结果的第一元素进行非归一化来生成具有非归一化元素的第三打包结果的步骤; 存储SIMD操作的第三打包结果的步骤; 以及设置标志以识别第一打包部分结果中的第三打包结果的非规范化元素的步骤。 版权所有(C)2010,JPO&INPIT

    Efficient parallel floating point exception handling in processor
    4.
    发明专利
    Efficient parallel floating point exception handling in processor 审中-公开
    处理器中有效的平行浮点处理处理

    公开(公告)号:JP2013050965A

    公开(公告)日:2013-03-14

    申请号:JP2012222612

    申请日:2012-10-05

    CPC classification number: G06F9/3861 G06F9/30014 G06F9/30036

    Abstract: PROBLEM TO BE SOLVED: To efficiently handle floating point exceptions in a processor that executes SIMD instructions.SOLUTION: The method comprising: identifying a numerical exception for a SIMD floating point operation; initiating a first SIMD micro-operation to generate a first packed partial result for the SIMD floating point operation; initiating a second SIMD micro-operation to generate a second packed partial result for the SIMD floating point operation; initiating a SIMD denormalization micro-operation to combine the first and second packed partial results and to denormalize a first element of the combined first and second packed partial results to generate a third packed result having a denormal element; storing the third packed result for the SIMD floating point operation; and setting a flag identifying the denormal element of the third packed result in the first packed partial result.

    Abstract translation: 要解决的问题:有效处理执行SIMD指令的处理器中的浮点异常。 解决方案:该方法包括:识别SIMD浮点运算的数字异常; 启动第一SIMD微操作以产生用于SIMD浮点运算的第一打包部分结果; 启动第二SIMD微操作以产生用于SIMD浮点运算的第二打包部分结果; 启动SIMD非规范化微操作以组合所述第一和第二打包部分结果,并且对所述组合的第一和第二打包部分结果的第一元素进行非规范化,以生成具有反正态元素的第三打包结果; 存储用于SIMD浮点运算的第三打包结果; 以及在第一打包部分结果中设置标识第三打包结果的异常元素的标志。 版权所有(C)2013,JPO&INPIT

    8.
    发明专利
    未知

    公开(公告)号:ES3013609T3

    公开(公告)日:2025-04-14

    申请号:ES20216494

    申请日:2019-10-08

    Applicant: INTEL CORP

    Abstract: Las realizaciones descritas se refieren a un procesador, un sistema en un chip y un sistema para ejecutar una instrucción de conversión de formato. En un ejemplo, un procesador que tiene una pluralidad de núcleos, incluido un núcleo que, en respuesta a una instrucción de conversión de formato que tiene un primer operando de origen que incluye un primer elemento de datos de punto flotante de precisión simple de 32 bits y un segundo operando de origen que incluye un segundo elemento de datos de punto flotante de precisión simple de 32 bits, debe: convertir el primer elemento de datos de punto flotante de precisión simple de 32 bits en un primer elemento de datos de punto flotante de 16 bits, en donde, cuando el primer elemento de datos de punto flotante de precisión simple de 32 bits es un elemento de datos normal, la conversión se debe realizar de acuerdo con un modo de redondeo especificado por la instrucción de conversión de formato, y el primer elemento de datos de punto flotante de 16 bits debe tener un bit de signo, un exponente de 8 bits, siete bits de mantisa explícitos y un bit de mantisa implícito, y en donde, cuando el primer elemento de datos de punto flotante de precisión simple de 32 bits es un elemento de datos que no es un número, NaN, el primer elemento de datos de punto flotante de 16 bits debe tener una mantisa con un máximo bit significativo establecido en uno; convertir el segundo elemento de datos de punto flotante de precisión simple de 32 bits en un segundo elemento de datos de punto flotante de 16 bits, donde, cuando el segundo elemento de datos de punto flotante de precisión simple de 32 bits es un elemento de datos normal, la conversión se debe realizar de acuerdo con el modo de redondeo, y el segundo elemento de datos de punto flotante de 16 bits debe tener un bit de signo, un exponente de 8 bits, siete bits de mantisa explícitos y un bit de mantisa implícito, y donde cuando el segundo elemento de datos de punto flotante de precisión simple de 32 bits es un elemento de datos NaN, el segundo elemento de datos de punto flotante de 16 bits debe tener una mantisa con un bit más significativo establecido en uno; y almacenar el primer elemento de datos de punto flotante de 16 bits en una mitad de orden inferior de un registro de destino y el segundo elemento de datos de punto flotante de 16 bits en una mitad de orden superior del registro de destino. (Traducción automática con Google Translate, sin valor legal)

    SYSTEMS AND METHODS FOR PERFORMING INSTRUCTIONS TO CONVERT TO 16-BIT FLOATING-POINT FORMAT

    公开(公告)号:FI3822774T3

    公开(公告)日:2025-02-27

    申请号:FI20216494

    申请日:2019-10-08

    Applicant: INTEL CORP

    Abstract: Disclosed embodiments relate to a processor and a method for executing a format conversion instruction. In one example, a processor comprises a decode unit to decode the format conversion instruction and an execution unit to execute the decoded format conversion instruction. The format conversion instruction indicates a location of a first source operand, a location of a second source operand, a destination register, a writemask register, and a type of masking, the first source operand to include a first plurality of 32-bit single-precision floating point data elements, the second source operand to include a second plurality of 32-bit single-precision floating point data elements, the writemask register to store a plurality of mask bits each corresponding to a data element position in the destination register, the type of masking to be either zeroing masking or merging masking.

Patent Agency Ranking