Efficient Sparse Banded Acoustic Models for Speech Recognition

Author

Weibin Zhang ; Fung, Pascale

Author_Institution

Dept. of Electron. & Comput. Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China

Volume

21

Issue

3

fYear

2014

fDate

Mar-14

Firstpage

280

Lastpage

283

Abstract

We propose sparse banded acoustic models to significantly improve the recognition accuracy and reduce the computational cost of speech recognition systems. The sparse banded models are trained using a weighted lasso regularization. In addition, we propose new feature orders to reduce the bandwidth of sparse banded models in order to speed up computation. Experimental results on the Wall Street Journal data set show that sparse banded models significantly outperform diagonal and full covariance models by 9.5% and 15.1% relatively. Sparse banded models also run the fastest. The advantages of sparse banded models are also demonstrated on the collected Cantonese data set.

Keywords

covariance matrices; sparse matrices; speech recognition; Cantonese data set; Wall Street Journal data set; full covariance model; sparse banded acoustic model; speech recognition accuracy; weighted lasso regularization; Acoustics; Computational modeling; Covariance matrices; Data models; Feature extraction; Hidden Markov models; Sparse matrices; Inverse covariance matrix; sparse banded models; speech recognition;

fLanguage

English

Journal_Title

Signal Processing Letters, IEEE

Publisher

ieee

ISSN

1070-9908

Type

jour

DOI

10.1109/LSP.2013.2292920

Filename

6675805