Title :
RBF models for detection of human speech in structured noise
Author :
Hoyt, John D. ; Wechsler, Harry
Author_Institution :
Eng. Res. Facility, Federal Bureau of Investigation, Quantico, VA, USA
fDate :
27 Jun-2 Jul 1994
Abstract :
This paper describes research to develop an efficient system that provides a binary decision as to the presence of speech in a short (one to three second) time sample of an acoustic signal. A method which is efficient and reliably detects human speech in the presence of structured noise (such as wind, music, traffic sounds, etc.) is described. There are methods which work well to detect speech in a communications environment, but previous methods can not distinguish speech from quasi-periodic signal that have a spectral power density similar to speech (such as music). Two separate feature sets are evaluated. Reliable detection is obtained down to signal to noise ratios (SNR) as low as 0 dB. The algorithm utilized is a statistical pattern classifier utilizing radial basis function (RBF) networks. Mel-cepstra and wavelet feature vectors are compared. A method of obtaining temporal feature information is described
Keywords :
feedforward neural nets; signal detection; speech processing; acoustic signal; binary decision; feature sets; human speech detection; mel-cepstra; music; quasi-periodic signal; radial basis function networks; spectral power density; statistical pattern classifier; structured noise; temporal feature information; traffic sounds; wavelet feature vectors; wind; Acoustic noise; Acoustic signal detection; Humans; Multiple signal classification; Music; Power system reliability; Signal to noise ratio; Speech enhancement; Telecommunication network reliability; Working environment noise;
Conference_Titel :
Neural Networks, 1994. IEEE World Congress on Computational Intelligence., 1994 IEEE International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-1901-X
DOI :
10.1109/ICNN.1994.374996