High perforamance machine learning inference framework for edge devices

Invention Grant

US11301762B1 High perforamance machine learning inference framework for edge devices 有权

Please log in to see more content

Patent Title: High perforamance machine learning inference framework for edge devices
Application No.: US16179217

Application Date: 2018-11-02
Publication No.: US11301762B1

Publication Date: 2022-04-12
Inventor: Gang Chen , Long Gao , Eduardo Manuel Calleja
Applicant: Amazon Technologies, Inc.
Applicant Address: US WA Seattle
Assignee: Amazon Technologies, Inc.
Current Assignee: Amazon Technologies, Inc.
Current Assignee Address: US WA Seattle
Agency: Nicholson De Vos Webster & Elliott LLP
Main IPC: G06N5/02
IPC: G06N5/02 ; G06N20/00 ; G06F16/11

High perforamance machine learning inference framework for edge devices

Abstract:

Techniques for high-performance machine learning (ML) inference in heterogenous edge devices are described. A ML model trained using a variety of different frameworks is translated into a common format that is runnable by inferences engines of edge devices. The translated model is optimized in hardware-agnostic and/or hardware-specific ways to improve inference performance, and the optimized model is sent to the edge devices. The inference engine for any edge device can be accessed by a customer application using a same defined API, regardless of the hardware characteristics of the edge device or the original format of the ML model.

Public/Granted literature

US1266764A Projectile-rotator. Public/Granted day:1918-05-21

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06N	基于特定计算模型的计算机系统
G06N5/00	利用基于知识的模式的计算机系统
G06N5/02	.知识表达