DocumentCode
3330205
Title
Modeling sub-band correlation for noise-robust speech recognition
Author
McAuley, James ; Ming, Ji ; Hanna, Philip ; Stewart, Darryl
Author_Institution
Sch. of Comput. Sci., Queen´´s Univ., Belfast, UK
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
The paper investigates the effect of modeling sub-band correlation for noisy speech recognition. Sub-band data streams are assumed to be independent in many sub-band based speech recognition systems. However, the structure and operation of the human vocal tract suggests this assumption is unrealistic. A novel method is proposed to incorporate correlation into sub-band speech feature streams. All possible combinations of sub-bands are created and each combination is treated as a single frequency band by calculating a single feature vector for it. The resulting feature vectors capture information about every band in the combination as well as the dependency across the bands. Experiments conducted on the TIDigits database demonstrate significantly improved robustness in comparison to an independent sub-band system in the presence of both stationary and non-stationary noise.
Keywords
acoustic noise; cepstral analysis; channel bank filters; correlation methods; discrete cosine transforms; random noise; speech recognition; DCT; TIDigits database; discrete cosine transform; feature vector; filter-bank cepstral analysis; human vocal tract; noise-robust speech recognition; nonstationary noise; speech feature streams; stationary noise; sub-band correlation; Cepstral analysis; Computer science; Discrete cosine transforms; Feature extraction; Frequency; Human voice; Noise robustness; Spatial databases; Speaker recognition; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326161
Filename
1326161
Link To Document