Efficient optimization for neural network deployment and execution

Invention Grant

US12079608B2 Efficient optimization for neural network deployment and execution 有权

Please log in to see more content

Patent Title: Efficient optimization for neural network deployment and execution
Application No.: US17513679

Application Date: 2021-10-28
Publication No.: US12079608B2

Publication Date: 2024-09-03
Inventor: Ashutosh Pandey , Kaiping Li , Vikram Kumar Ramanna
Applicant: Cypress Semiconductor Corporation
Applicant Address: US CA San Jose
Assignee: Cypress Semiconductor Corporation
Current Assignee: Cypress Semiconductor Corporation
Current Assignee Address: US CA San Jose
Main IPC: G06F9/50
IPC: G06F9/50 ; G06F8/41

Efficient optimization for neural network deployment and execution

Abstract:

Implementations disclosed describe methods and systems to perform the methods of deploying and executing machine learning models on target-specific computational platforms. Optimization techniques include but are not limited to alignment of kernel operations with hardware instructions of a target processing device, reduction of kernel dimensions near boundaries of data, efficient reuse of a small number of memory components during neural network operations, run-time quantization of data and neural network parameters, and other methods.

Public/Granted literature

US20220303176A1 EFFICIENT OPTIMIZATION FOR NEURAL NETWORK DEPLOYMENT AND EXECUTION Public/Granted day:2022-09-22

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F9/00	程序控制装置，例如，控制单元（用于外部设备的程序控制入G06F13/10）
G06F9/06	.应用存入的程序的，即应用处理设备的内部存储来接收程序并保持程序的
G06F9/46	..多道程序装置
G06F9/50	...资源分配，例如，中央处理单元[CPU]的