Low resource computational block for a trained neural network
Abstract:
A computational block configured to perform an inference task by applying a plurality of low resource computing operations to a binary input feature tensor to generate an integer feature tensor that is equivalent to an output of multiplication and accumulation operations performed in respect of a ternary weight tensor and the binary input feature tensor; and performing a comparison operation between the generated integer feature tensor and a comparison threshold to generate a binary output feature tensor. The plurality of low resource computing operations are applied to the binary input feature tensor using first and second weight tensors that each include n binary elements and that collectively represent a respective n elements of the ternary weight tensor
Public/Granted literature
Information query
Patent Agency Ranking
0/0