Invention Grant
- Patent Title: High perforamance machine learning inference framework for edge devices
-
Application No.: US16179217Application Date: 2018-11-02
-
Publication No.: US11301762B1Publication Date: 2022-04-12
- Inventor: Gang Chen , Long Gao , Eduardo Manuel Calleja
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Nicholson De Vos Webster & Elliott LLP
- Main IPC: G06N5/02
- IPC: G06N5/02 ; G06N20/00 ; G06F16/11

Abstract:
Techniques for high-performance machine learning (ML) inference in heterogenous edge devices are described. A ML model trained using a variety of different frameworks is translated into a common format that is runnable by inferences engines of edge devices. The translated model is optimized in hardware-agnostic and/or hardware-specific ways to improve inference performance, and the optimized model is sent to the edge devices. The inference engine for any edge device can be accessed by a customer application using a same defined API, regardless of the hardware characteristics of the edge device or the original format of the ML model.
Public/Granted literature
- US1266764A Projectile-rotator. Public/Granted day:1918-05-21
Information query