• DocumentCode
    1262
  • Title

    Efficient Sparse Banded Acoustic Models for Speech Recognition

  • Author

    Weibin Zhang ; Fung, Pascale

  • Author_Institution
    Dept. of Electron. & Comput. Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
  • Volume
    21
  • Issue
    3
  • fYear
    2014
  • fDate
    Mar-14
  • Firstpage
    280
  • Lastpage
    283
  • Abstract
    We propose sparse banded acoustic models to significantly improve the recognition accuracy and reduce the computational cost of speech recognition systems. The sparse banded models are trained using a weighted lasso regularization. In addition, we propose new feature orders to reduce the bandwidth of sparse banded models in order to speed up computation. Experimental results on the Wall Street Journal data set show that sparse banded models significantly outperform diagonal and full covariance models by 9.5% and 15.1% relatively. Sparse banded models also run the fastest. The advantages of sparse banded models are also demonstrated on the collected Cantonese data set.
  • Keywords
    covariance matrices; sparse matrices; speech recognition; Cantonese data set; Wall Street Journal data set; full covariance model; sparse banded acoustic model; speech recognition accuracy; weighted lasso regularization; Acoustics; Computational modeling; Covariance matrices; Data models; Feature extraction; Hidden Markov models; Sparse matrices; Inverse covariance matrix; sparse banded models; speech recognition;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2013.2292920
  • Filename
    6675805