• DocumentCode
    125567
  • Title

    A Parallel EM Algorithm for Gaussian Mixture Models Implemented on a NUMA System Using OpenMP

  • Author

    Kwedlo, Wojciech

  • Author_Institution
    Fac. of Comput. Sci., Bialystok Univ. of Technol., Bialystok, Poland
  • fYear
    2014
  • fDate
    12-14 Feb. 2014
  • Firstpage
    292
  • Lastpage
    298
  • Abstract
    In the paper the problem of estimation of Gaussian mixture model parameters is considered. A shared memory parallelization of the standard EM algorithm, based on data decomposition, is proposed. Our approach uses a rowwise block striped decomposition of large arrays storing feature vectors and posterior probabilities. Additionally, some NUMA optimizations, which allow threads to use as much local memory as possible, exploiting the first-touch memory allocation policy of the Linux operating system, are described. The proposed method was implemented in OpenMP and tested on a 64-core system based on four AMD Opteron 6272 (codenamed "Interlagos\´\´) processors. The experimental results indicate, that on large datasets, the algorithm scales very well with respect to the number of cores, and NUMA optimizations significantly improve its performance.
  • Keywords
    Gaussian processes; Linux; application program interfaces; expectation-maximisation algorithm; mixture models; multi-threading; multiprocessing systems; parallel algorithms; parameter estimation; storage management; 64-core system; AMD Opteron 6272; Gaussian mixture model; Interlagos processors; Linux operating system; NUMA optimizations; NUMA system; OpenMP; data decomposition; feature vectors; first-touch memory allocation policy; local memory; parallel EM algorithm; parameter estimation; posterior probabilities; rowwise block striped decomposition; shared memory parallelization; standard EM algorithm; threads; Arrays; Covariance matrices; Instruction sets; Linux; Optimization; Vectors; EM algorithm; Gaussian mixture model; NUMA; OpenMP;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-Based Processing (PDP), 2014 22nd Euromicro International Conference on
  • Conference_Location
    Torino
  • ISSN
    1066-6192
  • Type

    conf

  • DOI
    10.1109/PDP.2014.77
  • Filename
    6787289