Invention Grant
- Patent Title: Accelerated quantized multiply-and-add operations
-
Application No.: US15934681Application Date: 2018-03-23
-
Publication No.: US10678508B2Publication Date: 2020-06-09
- Inventor: Dana Michelle Vantrease , Randy Huang , Ron Diamant , Thomas Elmer , Sundeep Amirineni
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Kilpatrick Townsend & Stockton LLP
- Main IPC: G06F7/544
- IPC: G06F7/544 ; G06N3/08 ; G06F17/15 ; G06N3/063 ; G06N3/04

Abstract:
Disclosed herein are techniques for accelerating convolution operations or other matrix multiplications in applications such as neural network. A computer-implemented method includes receiving low-precision inputs for a convolution operation from a storage device, and subtracting a low-precision value representing a high-precision zero value from the low-precision inputs to generate difference values, where the low-precision inputs are asymmetrically quantized from high-precision inputs. The method also includes performing multiplication and summation operations on the difference values to generate a sum of products, and generating a high-precision output by scaling the sum of products with a scaling factor.
Public/Granted literature
- US20190294413A1 ACCELERATED QUANTIZED MULTIPLY-AND-ADD OPERATIONS Public/Granted day:2019-09-26
Information query