Invention Grant
- Patent Title: Sparse and data-parallel inference method and system for the latent Dirichlet allocation model
-
Application No.: US14755312Application Date: 2015-06-30
-
Publication No.: US09767416B2Publication Date: 2017-09-19
- Inventor: Jean-Baptiste Tristan , Guy L. Steele, Jr. , Joseph Tassarotti
- Applicant: Oracle International Corporation
- Applicant Address: US CA Redwood Shores
- Assignee: Oracle International Corporation
- Current Assignee: Oracle International Corporation
- Current Assignee Address: US CA Redwood Shores
- Agency: Hickman Palermo Becker Bingham LLP
- Main IPC: G06F17/17
- IPC: G06F17/17 ; G06F17/20 ; G06N7/00 ; G06F9/50 ; G06F17/27 ; G06F17/30

Abstract:
Herein is described a data-parallel and sparse algorithm for topic modeling. This algorithm is based on a highly parallel algorithm for a Greedy Gibbs sampler. The Greedy Gibbs sampler is a Markov-Chain Monte Carlo algorithm that estimates topics, in an unsupervised fashion, by estimating the parameters of the topic model Latent Dirichlet Allocation (LDA). The Greedy Gibbs sampler is a data-parallel algorithm for topic modeling, and is configured to be implemented on a highly-parallel architecture, such as a GPU. The Greedy Gibbs sampler is modified to take advantage of data sparsity while maintaining the parallelism. Furthermore, in an embodiment, implementation of the Greedy Gibbs sampler uses both densely-represented and sparsely-represented matrices to reduce the amount of computation while maintaining fast accesses to memory for implementation on a GPU.
Public/Granted literature
- US20160224544A1 SPARSE AND DATA-PARALLEL INFERENCE METHOD AND SYSTEM FOR THE LATENT DIRICHLET ALLOCATION MODEL Public/Granted day:2016-08-04
Information query