Apparatus for hardware accelerated machine learning

Invention Grant

US10817802B2 Apparatus for hardware accelerated machine learning 有权

Please log in to see more content

Patent Title: Apparatus for hardware accelerated machine learning
Application No.: US15588558

Application Date: 2017-05-05
Publication No.: US10817802B2

Publication Date: 2020-10-27
Inventor: Jeremy Bruestle , Choong Ng
Applicant: Intel Corporation
Applicant Address: US CA Santa Clara
Assignee: Intel Corporation
Current Assignee: Intel Corporation
Current Assignee Address: US CA Santa Clara
Agency: Trop, Pruner & Hu, P.C.
Main IPC: G06F12/00
IPC: G06F12/00 ; G06N20/00 ; G06F9/46 ; G06F7/48 ; G06F5/01 ; G06F7/58

Abstract:

An architecture and associated techniques of an apparatus for hardware accelerated machine learning are disclosed. The architecture features multiple memory banks storing tensor data. The tensor data may be concurrently fetched by a number of execution units working in parallel. Each operational unit supports an instruction set specific to certain primitive operations for machine learning. An instruction decoder is employed to decode a machine learning instruction and reveal one or more of the primitive operations to be performed by the execution units, as well as the memory addresses of the operands of the primitive operations as stored in the memory banks. The primitive operations, upon performed or executed by the execution units, may generate some output that can be saved into the memory banks. The fetching of the operands and the saving of the output may involve permutation and duplication of the data elements involved.

Public/Granted literature

US20170323224A1 APPARATUS FOR HARDWARE ACCELERATED MACHINE LEARNING Public/Granted day:2017-11-09

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F12/00	安装在筛选装置之上的在存储器系统或体系结构内的存取、寻址或分配（来自记录载体的数字输入，或者到记录载体上去的数字输出，例如到磁盘存储单元，G06F3/06）