• DocumentCode
    2424337
  • Title

    Using SIMD technology to speed up likelihood computation in HMM-based speech recognition systems

  • Author

    Ou, Jianlin ; Cai, Jun ; Lin, Qian

  • Author_Institution
    Dept. of Cognitive Sci., Xiamen Univ., Xiamen
  • fYear
    2008
  • fDate
    7-9 July 2008
  • Firstpage
    123
  • Lastpage
    127
  • Abstract
    Most state-of-the-art LVCSR systems are based on continuous density HMMs, which are typically implemented using Gaussian mixture distributions. Such statistical modeling systems usually operate slower than real-time, largely because of the heavy computational overhead of the likelihood computation. The objective of our research is to investigate application of modern SIMD technology to speed up the likelihood computation without degrading the recognition accuracy. In this paper, the likelihood computation of continuous density HMMs is analyzed to show that the conventional way of sequential computing is time-consuming and the likelihood computation itself can be implemented in parallel. A SIMD-based algorithm which can carry out parallel likelihood computation is presented in this paper. Likelihood computation modules in HTK3.4 toolkit have been modified with SIMD instructions to implement this algorithm. Experiments on TIMIT and WSJ0 corpora show that the SIMD-based data-level parallelism can significantly reduce the time overhead for likelihood computation.
  • Keywords
    Gaussian distribution; hidden Markov models; maximum likelihood estimation; parallel processing; speech recognition; vocabulary; Gaussian mixture distributions; HTK3.4 toolkit; SIMD technology; TIMIT; WSJ0 corpora; continuous density hidden Markov model; data-level parallelism; large vocabulary continuous speech recognition system; parallel likelihood computation; sequential computing; statistical modeling systems; Computer aided instruction; Computer architecture; Concurrent computing; Degradation; Hidden Markov models; Parallel processing; Real time systems; Registers; Speech recognition; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-1723-0
  • Electronic_ISBN
    978-1-4244-1724-7
  • Type

    conf

  • DOI
    10.1109/ICALIP.2008.4590086
  • Filename
    4590086