Systems and methods for trace norm regularization and faster inference for embedded models

Invention Grant

US11556775B2 Systems and methods for trace norm regularization and faster inference for embedded models 有权

Please log in to see more content

Patent Title: Systems and methods for trace norm regularization and faster inference for embedded models
Application No.: US16150855

Application Date: 2018-10-03
Publication No.: US11556775B2

Publication Date: 2023-01-17
Inventor: Markus Kliegl , Siddharth Goyal , Kexin Zhao , Kavya Srinet , Mohammad Shoeybi
Applicant: Baidu USA, LLC
Applicant Address: US CA Sunnyvale
Assignee: Baidu USA, LLC
Current Assignee: Baidu USA, LLC
Current Assignee Address: US CA Sunnyvale
Agency: North Weber & Baugh LLP
Main IPC: G06F17/16
IPC: G06F17/16 ; G06N3/08 ; G06N5/04 ; G10L15/06 ; G10L15/16

Systems and methods for trace norm regularization and faster inference for embedded models

Abstract:

Described herein are systems and methods for compressing and speeding up dense matrix multiplications as found, for examples, in the fully connected and recurrent layers of neural networks for embedded large vocabulary continuous speech recognition (LVCSR). For compression, trace norm regularization technique embodiments were introduced and studied for training low rank factored versions of matrix multiplications. Compared to standard low rank training, the methods more consistently lead to good accuracy versus number of parameter trade-offs and can be used to speed-up training of large models. Faster inference may be further enabled on ARM processors through kernels optimized for small batch sizes, resulting in speed ups over the currently used library. Beyond LVCSR, the techniques are also generally applicable to embedded neural networks with large fully connected or recurrent layers.

Public/Granted literature

US20190122108A1 SYSTEMS AND METHODS FOR TRACE NORM REGULARIZATION AND FASTER INFERENCE FOR EMBEDDED MODELS Public/Granted day:2019-04-25

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F17/00	特别适用于特定功能的数字计算设备或数据处理设备或数据处理方法（信息检索，数据库结构或文件系统结构，G06F 16/00）
G06F17/10	.复杂数学运算的
G06F17/16	..矩阵或向量计算的