• DocumentCode
    640555
  • Title

    Full covariance Gaussian mixture models evaluation on GPU

  • Author

    Vanek, Jan ; Trmal, Jan ; Psutka, Josef V. ; Psutka, Josef

  • Author_Institution
    Dept. of Cybern., Univ. of West Bohemia, Plzen, Czech Republic
  • fYear
    2012
  • fDate
    12-15 Dec. 2012
  • Abstract
    Gaussian mixture models (GMMs) are often used in various data processing and classification tasks to model a continuous probability density in a multi-dimensional space. In cases, where the dimension of the feature space is relatively high (e.g. in the automatic speech recognition (ASR)), GMM with a higher number of Gaussians with diagonal covariances (DC) instead of full covariances (FC) is used from the two reasons. The first reason is a problem how to estimate robust FC matrices with a limited training data set. The second reason is a much higher computational cost during the GMM evaluation. The first reason was addressed in many recent publications. In contrast, this paper describes an efficient implementation on Graphic Processing Unit (GPU) of the FC-GMM evaluation, which addresses the second reason. The performance was tested on acoustic models for ASR, and it is shown that even a low-end laptop GPU is capable to evaluate a large acoustic model in a fraction of the real speech time. Three variants of the algorithm were implemented and compared on various GPUs: NVIDIA CUDA, NVIDIA OpenCL, and ATI/AMD OpenCL.
  • Keywords
    Gaussian processes; covariance matrices; graphics processing units; ASR; ATI-AMD OpenCL; FC-GMM evaluation; NVIDIA CUDA; NVIDIA OpenCL; automatic speech recognition; continuous probability density function; data classification task; data processing task; diagonal covariances; full covariance Gaussian mixture models; full covariances; graphics processing unit; low-end laptop GPU; Computational modeling; Covariance matrices; Delays; Europe; Graphics processing units; Instruction sets; Automatic Speech Recognition; CUDA; Full Covariance; GPU; Gaussian Mixture Models; OpenCL;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Information Technology (ISSPIT), 2012 IEEE International Symposium on
  • Conference_Location
    Ho Chi Minh City
  • Print_ISBN
    978-1-4673-5604-6
  • Type

    conf

  • DOI
    10.1109/ISSPIT.2012.6621287
  • Filename
    6621287