DocumentCode
1262
Title
Efficient Sparse Banded Acoustic Models for Speech Recognition
Author
Weibin Zhang ; Fung, Pascale
Author_Institution
Dept. of Electron. & Comput. Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
Volume
21
Issue
3
fYear
2014
fDate
Mar-14
Firstpage
280
Lastpage
283
Abstract
We propose sparse banded acoustic models to significantly improve the recognition accuracy and reduce the computational cost of speech recognition systems. The sparse banded models are trained using a weighted lasso regularization. In addition, we propose new feature orders to reduce the bandwidth of sparse banded models in order to speed up computation. Experimental results on the Wall Street Journal data set show that sparse banded models significantly outperform diagonal and full covariance models by 9.5% and 15.1% relatively. Sparse banded models also run the fastest. The advantages of sparse banded models are also demonstrated on the collected Cantonese data set.
Keywords
covariance matrices; sparse matrices; speech recognition; Cantonese data set; Wall Street Journal data set; full covariance model; sparse banded acoustic model; speech recognition accuracy; weighted lasso regularization; Acoustics; Computational modeling; Covariance matrices; Data models; Feature extraction; Hidden Markov models; Sparse matrices; Inverse covariance matrix; sparse banded models; speech recognition;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2013.2292920
Filename
6675805
Link To Document