Title :
Speech — Nonspeech discrimination based on speech-relevant spectrogram modulations
Author :
Wohlmayr, Michael ; Markaki, Maria ; Stylianou, Yannis
Author_Institution :
Comput. Sci. Dept., Univ. of Crete, Heraklion, Greece
Abstract :
In this work, we adopt an information theoretic approach - the Information Bottleneck method - to extract the relevant modulation frequencies across both dimensions of a spectrogram, for speech / non-speech discrimination (music, animal vocalizations, environmental noises). A compact representation is built for each sound ensemble, consisting of the maximally informative features. We demonstrate the effectiveness of a simple thresholding classifier which is based on the similarity of a sound to each characteristic modulation spectrum. When we assess the performance of the classification system at various SNR conditions using F-measure, results are equally good to a recently proposed method based on the same features but having significantly greater complexity.
Keywords :
feature extraction; signal classification; speech processing; F-measure; characteristic modulation spectrum; compact representation; information bottleneck method; information theoretic approach; maximally informative features; modulation frequencies; sound ensemble; spectrogram; speech-non-speech discrimination; thresholding classifier; Complexity theory; Feature extraction; Modulation; Signal to noise ratio; Speech; Speech processing;
Conference_Titel :
Signal Processing Conference, 2007 15th European
Conference_Location :
Poznan
Print_ISBN :
978-839-2134-04-6