Abstract:
A vector functional unit implemented on a semiconductor chip to perform vector operations of dimension N is described. The vector functional unit includes N functional units. Each of the N functional units have logic circuitry to perform: a first integer multiply add instruction that presents highest ordered bits but not lowest ordered bits of a first integer multiply add calculation, and, a second integer multiply add instruction that presents lowest ordered bits but not highest ordered bits of a second integer multiply add calculation.
Abstract:
A method of performing vector operations on a semiconductor chip is described. The method includes performing a first vector instruction with a vector functional unit implemented on the semiconductor chip and performing a second vector instruction with the vector functional unit. The first vector instruction is a vector multiply add instruction. The second vector instruction is a vector leading zeros count instruction.
Abstract:
A semiconductor processor is described. The semiconductor processor includes logic circuitry to perform a logical reduction instruction. The logic circuitry has swizzle circuitry to swizzle a vector's elements so as to form a swizzle vector. The logic circuitry also has vector logic circuitry to perform a vector logic operation on said vector and said swizzle vector.
Abstract:
A vector functional unit implemented on a semiconductor chip to perform vector operations of dimension N is described. The vector functional unit includes N functional units. Each of the N functional units have logic circuitry to perform: a first integer multiply add instruction that presents highest ordered bits but not lowest ordered bits of a first integer multiply add calculation, and, a second integer multiply add instruction that presents lowest ordered bits but not highest ordered bits of a second integer multiply add calculation.
Abstract:
Technologien für Unterbrechungs-disassoziierte Warteschlangenbildung für Multi-Warteschlangen-Eingabe-/Ausgabe-Vorrichtungen umfassen ein Bestimmen, ob ein Netzwerkpaket in einer Unterbrechungs-disassoziierten Warteschlange angekommen ist, und ein Liefern des Netzwerkpakets an eine Anwendung, die durch den Rechenknoten gemanagt wird. Die Anwendung ist mit einem Anwendungs-Thread assoziiert, und die Unterbrechungs-disassoziierte Warteschlange kann in einem Polling-Modus sein. Nachfolgend kann, ansprechend auf ein Übergangsereignis, die Unterbrechungs-disassoziierte Warteschlange in einen Unterbrechungsmodus übergehen.
Abstract:
Es werden Ausführungsformen von Systemen, Vorrichtungen und Verfahren zum Ausführen einer Expandier- und/oder Komprimieranweisung in einem Computerprozessor beschrieben. Bei bestimmten Ausführungsformen bewirkt Die Ausführung einer Expandieranweisung die Auswahl von Elementen aus einer Quelle, die spärlich in einem Ziel zu speichern sind, auf der Basis von Werten der Schreibmaske, und Speichern jedes ausgewählten Datenelements der Quelle als spärliches Datenelement in einer Zielspeicherstelle, wobei die Zielspeicherstellen jeweils Schreibmasken-Bitpositionen entsprechen, die angeben, dass das entsprechende Datenelement der Quelle zu speichern ist.
Abstract:
A vector friendly instruction format and execution thereof. According to one embodiment of the invention, a processor is configured to execute an instruction set. The instruction set includes a vector friendly instruction format. The vector friendly instruction format has a plurality of fields including a base operation field, a modifier field, an augmentation operation field, and a data element width field, wherein the first instruction format supports different versions of base operations and different augmentation operations through placement of different values in the base operation field, the modifier field, the alpha field, the beta field, and the data element width field, and wherein only one of the different values may be placed in each of the base operation field, the modifier field, the alpha field, the beta field, and the data element width field on each occurrence of an instruction in the first instruction format in instruction streams.
Abstract:
sistemas, aparelhos e métodos para mistura de dois operandos de fonte em destinação única usando writemask a presente invenção refere-se a concretizações de sistemas, aparelhos, e métodos para realização de uma instrução agrupada em um processador de computação. em algumas concretizações, a execução de uma instrução agrupada causa uma seleção de dados elementos por elemento dos elementos de dados dos primeiros e segundo operandos de fonte usando as posições de btye correspondentes de um writemask como um seletor entre os primeiros e segundio operandos, e armazenagem dos elementos de dados selecionados na destinação na posição correspondente na destinação.
Abstract:
Ausführungsformen von Systemen, Vorrichtungen und Verfahren zum Durchführen eines Vermischungsbefehls in einem Computerprozessor werden beschrieben. In einigen Ausführungsformen veranlaßt die Ausführung eines Vermischungsbefehls eine datenelementweise Selektion von Datenelementen erster und zweiter Quelloperanden unter Verwendung der entsprechenden Bitpositionen einer Schreibmaske als ein Selektor zwischen den ersten und zweiten Operanden und Speichern der selektierten Datenelemente in dem Ziel an der entsprechenden Position in dem Ziel.