Method and apparatus with neural network layer contraction
Abstract:
A processor-implemented neural network method includes: determining a reference sample among sequential input samples to be processed by a neural network, the neural network comprising an input layer, one or more hidden layers, and an output layer; performing an inference process of obtaining an output activation of the output layer based on operations in the hidden layers corresponding to the reference sample input to the input layer; determining layer contraction parameters for determining an affine transformation relationship between the input layer and the output layer, for approximation of the inference process; and performing inference on one or more other sequential input samples among the sequential input samples using affine transformation based on the layer contraction parameters determined with respect to the reference sample.
Public/Granted literature
Information query
Patent Agency Ranking
0/0