• DocumentCode
    1984262
  • Title

    Noise and speaker robustness in a Persian continuous speech recognition system

  • Author

    Veisi, Hadi ; Sameti, Hossein

  • Author_Institution
    Dept. of Comput. Eng., Sharif Univ. of Technol., Tehran
  • fYear
    2007
  • fDate
    12-15 Feb. 2007
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper VTLN speaker normalization, MLLR and MAP adaptation methods are investigated in a Persian HMM-based speaker independent large vocabulary continuous speech recognition system. Speaker and environmental noise robustness are achieved in real world applications for this system. A search-based method is used in VTLN to find speaker relative warping factors. The warping factors are applied to signalpsilas spectrum to normalize the variation effect of VTL between speakers. In the MLLR framework, Gaussian mean and covariance transformations in global and full adaptation are experienced. In this method, regression tree based adaptation in batch-supervised fashion is used. Also the standard MAP is experienced as an adaptation method. Combinations of these approaches with CMN robust feature method are evaluated on 4 different tasks. Significant improvement is achieved in the recognition performance in noisy environments such that it makes the system operational in real applications.
  • Keywords
    natural languages; regression analysis; speech recognition; trees (mathematics); Gaussian mean; Persian continuous speech recognition system; covariance transformations; large vocabulary continuous speech recognition system; regression tree based adaptation; search-based method; speaker normalization; speaker robustness; Application software; Cepstral analysis; Frequency; Loudspeakers; Maximum likelihood linear regression; Noise robustness; Regression tree analysis; Speech recognition; Vocabulary; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Its Applications, 2007. ISSPA 2007. 9th International Symposium on
  • Conference_Location
    Sharjah
  • Print_ISBN
    978-1-4244-0778-1
  • Electronic_ISBN
    978-1-4244-1779-8
  • Type

    conf

  • DOI
    10.1109/ISSPA.2007.4555292
  • Filename
    4555292