DocumentCode :
1059680
Title :
Exploring the Use of Speech Features and Their Corresponding Distribution Characteristics for Robust Speech Recognition
Author :
Lin, Shih-Hsiang ; Chen, Berlin ; Yeh, Yao-Ming
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Normal Univ., Taipei
Volume :
17
Issue :
1
fYear :
2009
Firstpage :
84
Lastpage :
94
Abstract :
The performance of current automatic speech recognition (ASR) systems often deteriorates radically when the input speech is corrupted by various kinds of noise sources. Several methods have been proposed to improve ASR robustness over the last few decades. The related literature can be generally classified into two categories according to whether the methods are directly based on the feature domain or consider some specific statistical feature characteristics. In this paper, we present a polynomial regression approach that has the merit of directly characterizing the relationship between speech features and their corresponding distribution characteristics to compensate for noise interference. The proposed approach and a variant were thoroughly investigated and compared with a few existing noise robustness approaches. All experiments were conducted using the Aurora-2 database and task. The results show that our approaches achieve considerable word error rate reductions over the baseline system and are comparable to most of the conventional robustness approaches discussed in this paper.
Keywords :
polynomials; regression analysis; speech recognition; Aurora-2 database; automatic speech recognition; noise interference; polynomial regression approach; robust speech recognition; speech features; statistical feature characteristics; Automatic speech recognition; Interference; Linear discriminant analysis; Noise level; Noise robustness; Polynomials; Spatial databases; Speech enhancement; Speech recognition; Vectors; Clustering; histogram equalization; polynomial regression; robustness; speech recognition;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2008.2007612
Filename :
4740142
Link To Document :
بازگشت