DocumentCode :
3600341
Title :
Environmental sniffing: noise knowledge estimation for robust speech systems
Author :
Akbacak, Murat ; Hansen, John H L
Author_Institution :
Robust Speech Process. Group, Colorado Univ., Boulder, CO, USA
Volume :
2
fYear :
2003
Abstract :
We propose a framework for extracting knowledge about environmental noise from an input audio sequence and organizing this knowledge for use by other speech systems. To date, most approaches dealing with environmental noise in speech systems are based on assumptions about the noise, or differences in the collection of and training on a specific noise condition, rather than exploring the nature of the noise. We are interested in constructing a new speech framework, entitled environmental sniffing, to detect, classify and track acoustic environmental conditions. The first goal of the framework is to seek out detailed information about the environmental characteristics instead of just detecting environmental changes. The second goal is to organize this knowledge in an effective manner to allow smart decisions to direct other speech systems. Our current framework uses a number of speech processing modules including the Teager energy operator (TEO) and a hybrid algorithm with T2-BIC segmentation, noise language modeling and GMM classification in noise knowledge estimation. We define a new information criterion that incorporates the impact of noise on environmental sniffing performance. We use an in-vehicle speech and noise environment as a test platform for our evaluations and investigate the integration of environmental sniffing into an automatic speech recognition (ASR) engine in this environment. Noise classification experiments show that the hybrid algorithm achieves an error rate of 25.51%, outperforming a baseline system by an absolute 7.08%.
Keywords :
Bayes methods; Gaussian processes; acoustic noise; acoustic signal processing; audio signal processing; decision theory; knowledge acquisition; parameter estimation; signal classification; speech processing; speech recognition; BIC; Bayesian information criterion; GMM classification; Gaussian mixture model; Teager energy operator; acoustic environmental conditions; audio sequence; automatic speech recognition; environmental noise; environmental sniffing; in-vehicle noise; noise classification; noise knowledge estimation; robust speech systems; smart decisions; speech processing modules; Acoustic noise; Acoustic signal detection; Automatic speech recognition; Natural languages; Noise robustness; Organizing; Speech analysis; Speech enhancement; Speech processing; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1202307
Filename :
1202307
Link To Document :
بازگشت