Using SIMD technology to speed up likelihood computation in HMM-based speech recognition systems

Author

Ou, Jianlin ; Cai, Jun ; Lin, Qian

Author_Institution

Dept. of Cognitive Sci., Xiamen Univ., Xiamen

fYear

2008

fDate

7-9 July 2008

Firstpage

123

Lastpage

127

Abstract

Most state-of-the-art LVCSR systems are based on continuous density HMMs, which are typically implemented using Gaussian mixture distributions. Such statistical modeling systems usually operate slower than real-time, largely because of the heavy computational overhead of the likelihood computation. The objective of our research is to investigate application of modern SIMD technology to speed up the likelihood computation without degrading the recognition accuracy. In this paper, the likelihood computation of continuous density HMMs is analyzed to show that the conventional way of sequential computing is time-consuming and the likelihood computation itself can be implemented in parallel. A SIMD-based algorithm which can carry out parallel likelihood computation is presented in this paper. Likelihood computation modules in HTK3.4 toolkit have been modified with SIMD instructions to implement this algorithm. Experiments on TIMIT and WSJ0 corpora show that the SIMD-based data-level parallelism can significantly reduce the time overhead for likelihood computation.

Keywords

Gaussian distribution; hidden Markov models; maximum likelihood estimation; parallel processing; speech recognition; vocabulary; Gaussian mixture distributions; HTK3.4 toolkit; SIMD technology; TIMIT; WSJ0 corpora; continuous density hidden Markov model; data-level parallelism; large vocabulary continuous speech recognition system; parallel likelihood computation; sequential computing; statistical modeling systems; Computer aided instruction; Computer architecture; Concurrent computing; Degradation; Hidden Markov models; Parallel processing; Real time systems; Registers; Speech recognition; Streaming media;

fLanguage

English

Publisher

ieee

Conference_Titel

Audio, Language and Image Processing, 2008. ICALIP 2008. International Conference on

Conference_Location

Shanghai

Print_ISBN

978-1-4244-1723-0

Electronic_ISBN

978-1-4244-1724-7

Type

conf

DOI

10.1109/ICALIP.2008.4590086

Filename

4590086