-
公开(公告)号:US20210174214A1
公开(公告)日:2021-06-10
申请号:US17108643
申请日:2020-12-01
Applicant: The MathWorks, Inc.
Inventor: Vaidehi Venkatesan , Jayaprabha Shankar , Shixin Zhuang , Girish Venkataramani , FNU Hanumantharayappa
Abstract: Systems and methods quantize an application having a trained Deep Neural Network (DNN) for deployment on target hardware. The application may be instrumented to observe data values generated during execution of the application. Statistics may be generated for the observed data values and presented in a visualization tool. The application may be quantized through a rules based approach. The quantization may be based on the statistics and on constraints imposed by resources available at the target hardware. The systems and methods may present the proposed data types resulting from the quantization and may create a quantized version of the application incorporating the proposed data types. The systems and methods may generate performance data to validate the quantized version of the application. Changes to the rules may be made and the quantization process repeated if the performance is not satisfactory.