Title :
Security monitoring using microphone arrays and audio classification
Author :
Abu-El-Quran, Ahmad R. ; Goubran, Rafik A. ; Chan, Adrian D C
Author_Institution :
Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, Ont.
Abstract :
In the paper, the authors propose a security monitoring system that can detect and classify the location and nature of different sounds within a room. This system is reliable and robust even in the presence of reverberation and in low signal-to-noise (SNR) environments. We describe a novel algorithm for audio classification, which, first, classifies an audio segment as speech or nonspeech and, second, classifies nonspeech audio segments into a particular audio type. To classify an audio segment as speech or nonspeech, this algorithm divides the audio segment into frames, estimates the presence of pitch in each frame, and calculates a pitch ratio (PR) parameter; it is this PR parameter that is used to discriminate speech audio segments from nonspeech audio segments. The discerning threshold for the PR parameter is adaptive to accommodate different environments. A time-delayed neural network is employed to further classify nonspeech audio segments into an audio type. The performance of this novel audio classification algorithm is evaluated using a library of audio segments. This library includes both speech segments and nonspeech segments, such as windows breaking and footsteps. Evaluation is performed under different SNR environments, both with and without reverberation. Using 0.4-s audio segments, the proposed algorithm can achieve an average classification accuracy of 94.5% for the reverberant library and 95.1% for the nonreverberant library
Keywords :
acoustic signal detection; acoustic signal processing; array signal processing; microphone arrays; neural nets; reverberation; security; signal classification; audio classification; audio segments; discerning threshold; feature extraction; microphone arrays; nonspeech segments; pitch ratio parameter; security monitoring; speech processing; time-delayed neural network; Classification algorithms; Libraries; Microphone arrays; Monitoring; Neural networks; Performance evaluation; Reverberation; Robustness; Security; Speech; Audio classification; beamforming; feature extraction; microphone arrays; security monitoring; speech processing;
Journal_Title :
Instrumentation and Measurement, IEEE Transactions on
DOI :
10.1109/TIM.2006.876394