Title :
A Method of Accelerating LDA Program with GPU
Author :
Jiang, Yanjun ; Wen, Hualong ; Gao, Zhanchun
Abstract :
LDA (Latent Dirichlet Allocation) is a text modeling algorithm based on a generative probabilistic model. It is widely used to discover latent topics among a set of documents. Mahout has implemented LDA algorithm, however, the execution time of the LDA program is very long when processing a large amount of documents, because the documents are processed in sequence. This paper introduces a method to modify this program with CUDA toolkit provided by NVIDIA, in order that a group of documents could be processed in parallel on GPU. Using this method, the LDA program could be accelerated greatly.
Keywords :
graphics processing units; parallel architectures; probability; resource allocation; text analysis; CUDA toolkit; GPU; LDA algorithm; LDA program acceleration method; NVIDIA; execution time; generative probabilistic model; latent Dirichlet allocation; latent topic discovery; parallel processing; text modeling algorithm; Acceleration; Estimation; Graphics processing units; Instruction sets; Kernel; Parallel processing; Vectors; CUDA; GPU; LDA; Mahout;
Conference_Titel :
Networking and Distributed Computing (ICNDC), 2012 Third International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-2858-6
DOI :
10.1109/ICNDC.2012.14