Invention Grant
- Patent Title: Apparatus and method for vector multiply and accumulate of packed bytes
-
Application No.: US15879419Application Date: 2018-01-24
-
Publication No.: US11768681B2Publication Date: 2023-09-26
- Inventor: Alexander Heinecke , Dipankar Das , Robert Valentine , Mark Charney
- Applicant: Intel Corporation
- Applicant Address: US CA Santa Clara
- Assignee: Intel Corporation
- Current Assignee: Intel Corporation
- Current Assignee Address: US CA Santa Clara
- Agency: NICHOLSON DE VOS WEBSTER & ELLIOTT LLP
- Main IPC: G06F9/30
- IPC: G06F9/30 ; G06F9/38

Abstract:
An apparatus and method for performing multiply-accumulate operations. For example, one embodiment of a processor comprises: a decoder to decode instructions; a first source register to store a first plurality of packed bytes; a second source register to store a second plurality of packed bytes; a third source register to store a plurality of packed doublewords; execution circuitry to execute a first instruction, the execution circuitry comprising: extension circuitry to sign-extend or zero-extend the first and second plurality of packed bytes to generate a first and second plurality of words corresponding to the first and second plurality of packed bytes; multiplier circuitry to multiply each of the first plurality of words with a corresponding one of the second plurality of words to generate a plurality of temporary products; adder circuitry to add at least a first set of the temporary products to generate a first temporary sum; accumulation circuitry to combine the first temporary sum with a first packed doubleword value from a first doubleword location in the third source register to generate a first accumulated doubleword result; a destination register to store the first accumulated doubleword result in the first doubleword location.
Public/Granted literature
- US20190042236A1 APPARATUS AND METHOD FOR VECTOR MULTIPLY AND ACCUMULATE OF PACKED BYTES Public/Granted day:2019-02-07
Information query