DocumentCode
2424337
Title
Using SIMD technology to speed up likelihood computation in HMM-based speech recognition systems
Author
Ou, Jianlin ; Cai, Jun ; Lin, Qian
Author_Institution
Dept. of Cognitive Sci., Xiamen Univ., Xiamen
fYear
2008
fDate
7-9 July 2008
Firstpage
123
Lastpage
127
Abstract
Most state-of-the-art LVCSR systems are based on continuous density HMMs, which are typically implemented using Gaussian mixture distributions. Such statistical modeling systems usually operate slower than real-time, largely because of the heavy computational overhead of the likelihood computation. The objective of our research is to investigate application of modern SIMD technology to speed up the likelihood computation without degrading the recognition accuracy. In this paper, the likelihood computation of continuous density HMMs is analyzed to show that the conventional way of sequential computing is time-consuming and the likelihood computation itself can be implemented in parallel. A SIMD-based algorithm which can carry out parallel likelihood computation is presented in this paper. Likelihood computation modules in HTK3.4 toolkit have been modified with SIMD instructions to implement this algorithm. Experiments on TIMIT and WSJ0 corpora show that the SIMD-based data-level parallelism can significantly reduce the time overhead for likelihood computation.
Keywords
Gaussian distribution; hidden Markov models; maximum likelihood estimation; parallel processing; speech recognition; vocabulary; Gaussian mixture distributions; HTK3.4 toolkit; SIMD technology; TIMIT; WSJ0 corpora; continuous density hidden Markov model; data-level parallelism; large vocabulary continuous speech recognition system; parallel likelihood computation; sequential computing; statistical modeling systems; Computer aided instruction; Computer architecture; Concurrent computing; Degradation; Hidden Markov models; Parallel processing; Real time systems; Registers; Speech recognition; Streaming media;
fLanguage
English
Publisher
ieee
Conference_Titel
Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4244-1723-0
Electronic_ISBN
978-1-4244-1724-7
Type
conf
DOI
10.1109/ICALIP.2008.4590086
Filename
4590086
Link To Document