DocumentCode
394328
Title
Feature space normalization in adverse acoustic conditions
Author
Molau, Sirko ; Hilger, Florian ; Ney, Hermann
Author_Institution
Comput. Sci. Dept., RWTH, Aachen, Germany
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
We study the effect of different feature space normalization techniques in adverse acoustic conditions. Recognition tests are reported for cepstral mean and variance normalization, histogram normalization, feature space rotation, and vocal tract length normalization on a German isolated word recognition task with large acoustic mismatch. The training data was recorded in a clean office environment and the test data in cars. Speech recognition failed completely without normalization on the highway dataset, whereas the word error rate could be reduced to 17% using an online setup and to 10% with an offline setup.
Keywords
acoustic noise; acoustic signal processing; error statistics; learning (artificial intelligence); natural languages; speech recognition; German isolated word recognition task; acoustic mismatch; adverse acoustic conditions; cepstral mean normalization; cepstral variance normalization; feature space normalization; feature space rotation; histogram normalization; speech recognition; vocal tract length normalization; word error rate; Acoustic testing; Automated highways; Automatic speech recognition; Cepstral analysis; Cities and towns; Histograms; Space technology; Speech recognition; System testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198866
Filename
1198866
Link To Document