Invention Grant
US08489529B2 Deep convex network with joint use of nonlinear random projection, Restricted Boltzmann Machine and batch-based parallelizable optimization
有权
联合使用非线性随机投影的深凸网络,限制玻尔兹曼机器和基于批量的可并行化优化
- Patent Title: Deep convex network with joint use of nonlinear random projection, Restricted Boltzmann Machine and batch-based parallelizable optimization
- Patent Title (中): 联合使用非线性随机投影的深凸网络,限制玻尔兹曼机器和基于批量的可并行化优化
-
Application No.: US13077978Application Date: 2011-03-31
-
Publication No.: US08489529B2Publication Date: 2013-07-16
- Inventor: Li Deng , Dong Yu , Alejandro Acero
- Applicant: Li Deng , Dong Yu , Alejandro Acero
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Main IPC: G06N5/00
- IPC: G06N5/00

Abstract:
A method is disclosed herein that includes an act of causing a processor to access a deep-structured, layered or hierarchical model, called deep convex network, retained in a computer-readable medium, wherein the deep-structured model comprises a plurality of layers with weights assigned thereto. This layered model can produce the output serving as the scores to combine with transition probabilities between states in a hidden Markov model and language model scores to form a full speech recognizer. The method makes joint use of nonlinear random projections and RBM weights, and it stacks a lower module's output with the raw data to establish its immediately higher module. Batch-based, convex optimization is performed to learn a portion of the deep convex network's weights, rendering it appropriate for parallel computation to accomplish the training. The method can further include the act of jointly substantially optimizing the weights, the transition probabilities, and the language model scores of the deep-structured model using the optimization criterion based on a sequence rather than a set of unrelated frames.
Public/Granted literature
Information query
IPC分类:
G | 物理 |
G06 | 计算;推算或计数 |
G06N | 基于特定计算模型的计算机系统 |
G06N5/00 | 利用基于知识的模式的计算机系统 |