• DocumentCode
    2605306
  • Title

    A Method of Accelerating LDA Program with GPU

  • Author

    Jiang, Yanjun ; Wen, Hualong ; Gao, Zhanchun

  • fYear
    2012
  • fDate
    21-24 Oct. 2012
  • Firstpage
    26
  • Lastpage
    29
  • Abstract
    LDA (Latent Dirichlet Allocation) is a text modeling algorithm based on a generative probabilistic model. It is widely used to discover latent topics among a set of documents. Mahout has implemented LDA algorithm, however, the execution time of the LDA program is very long when processing a large amount of documents, because the documents are processed in sequence. This paper introduces a method to modify this program with CUDA toolkit provided by NVIDIA, in order that a group of documents could be processed in parallel on GPU. Using this method, the LDA program could be accelerated greatly.
  • Keywords
    graphics processing units; parallel architectures; probability; resource allocation; text analysis; CUDA toolkit; GPU; LDA algorithm; LDA program acceleration method; NVIDIA; execution time; generative probabilistic model; latent Dirichlet allocation; latent topic discovery; parallel processing; text modeling algorithm; Acceleration; Estimation; Graphics processing units; Instruction sets; Kernel; Parallel processing; Vectors; CUDA; GPU; LDA; Mahout;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Networking and Distributed Computing (ICNDC), 2012 Third International Conference on
  • Conference_Location
    Hangzhou
  • ISSN
    2165-5006
  • Print_ISBN
    978-1-4673-2858-6
  • Type

    conf

  • DOI
    10.1109/ICNDC.2012.14
  • Filename
    6386645