Data-optimized neural network traversal
Abstract:
Executing a neural network includes generating an output tile of a first layer of the neural network by processing an input tile to the first layer and storing the output tile of the first layer in an internal memory of a processor. An output tile of a second layer of the neural network can be generated using the processor by processing the output tile of the first layer stored in the internal memory.
Public/Granted literature
Information query
Patent Agency Ranking
0/0