DocumentCode
1984262
Title
Noise and speaker robustness in a Persian continuous speech recognition system
Author
Veisi, Hadi ; Sameti, Hossein
Author_Institution
Dept. of Comput. Eng., Sharif Univ. of Technol., Tehran
fYear
2007
fDate
12-15 Feb. 2007
Firstpage
1
Lastpage
4
Abstract
In this paper VTLN speaker normalization, MLLR and MAP adaptation methods are investigated in a Persian HMM-based speaker independent large vocabulary continuous speech recognition system. Speaker and environmental noise robustness are achieved in real world applications for this system. A search-based method is used in VTLN to find speaker relative warping factors. The warping factors are applied to signalpsilas spectrum to normalize the variation effect of VTL between speakers. In the MLLR framework, Gaussian mean and covariance transformations in global and full adaptation are experienced. In this method, regression tree based adaptation in batch-supervised fashion is used. Also the standard MAP is experienced as an adaptation method. Combinations of these approaches with CMN robust feature method are evaluated on 4 different tasks. Significant improvement is achieved in the recognition performance in noisy environments such that it makes the system operational in real applications.
Keywords
natural languages; regression analysis; speech recognition; trees (mathematics); Gaussian mean; Persian continuous speech recognition system; covariance transformations; large vocabulary continuous speech recognition system; regression tree based adaptation; search-based method; speaker normalization; speaker robustness; Application software; Cepstral analysis; Frequency; Loudspeakers; Maximum likelihood linear regression; Noise robustness; Regression tree analysis; Speech recognition; Vocabulary; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Its Applications, 2007. ISSPA 2007. 9th International Symposium on
Conference_Location
Sharjah
Print_ISBN
978-1-4244-0778-1
Electronic_ISBN
978-1-4244-1779-8
Type
conf
DOI
10.1109/ISSPA.2007.4555292
Filename
4555292
Link To Document