Abstract:
A single macroinstruction is received, which specifies at least two logical registers. These registers each store first and second operands in packed data form with data elements in mutual correspondence. An operation specified from the individual macroinstructions, is executed on numbers of first and second mutually-corresponding data elements of the first and second operands, at different times, using the same circuit to produce independently, numbers of first and second resulting data elements. These are stored in a single logical register as third packed data operands. An Independent claim is included for the corresponding processor and system.
Abstract:
A method and apparatus are disclosed for staggering execution of an instruction. According to one embodiment of the invention, a single macro instruction is received wherein the single macro instruction specifies at least two logical registers and wherein the two logical registers respectively store a first and second packed data operands having corresponding data elements. An operation specified by the single macro instruction is then performed independently on a first and second plurality of the corresponding data elements from said first and second packed data operands at different times using the same circuit to independently generate a first and second plurality of resulting data elements. The first and second plurality of resulting data elements are stored in a single logical register as a third packed data operand.
Abstract:
A method and apparatus are disclosed for staggering execution of an instruction. According to one embodiment of the invention, a single macro instruction is received wherein the single macro instruction specifies at least two logical registers and wherein the two logical registers respectively store a first and second packed data operands having corresponding data elements. An operation specified by the single macro instruction is then performed independently on a first and second plurality of the corresponding data elements from said first and second packed data operands at different times using the same circuit to independently generate a first and second plurality of resulting data elements. The first and second plurality of resulting data elements are stored in a single logical register as a third packed data operand.