DocumentCode :
1038732
Title :
Security monitoring using microphone arrays and audio classification
Author :
Abu-El-Quran, Ahmad R. ; Goubran, Rafik A. ; Chan, Adrian D C
Author_Institution :
Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, Ont.
Volume :
55
Issue :
4
fYear :
2006
Firstpage :
1025
Lastpage :
1032
Abstract :
In the paper, the authors propose a security monitoring system that can detect and classify the location and nature of different sounds within a room. This system is reliable and robust even in the presence of reverberation and in low signal-to-noise (SNR) environments. We describe a novel algorithm for audio classification, which, first, classifies an audio segment as speech or nonspeech and, second, classifies nonspeech audio segments into a particular audio type. To classify an audio segment as speech or nonspeech, this algorithm divides the audio segment into frames, estimates the presence of pitch in each frame, and calculates a pitch ratio (PR) parameter; it is this PR parameter that is used to discriminate speech audio segments from nonspeech audio segments. The discerning threshold for the PR parameter is adaptive to accommodate different environments. A time-delayed neural network is employed to further classify nonspeech audio segments into an audio type. The performance of this novel audio classification algorithm is evaluated using a library of audio segments. This library includes both speech segments and nonspeech segments, such as windows breaking and footsteps. Evaluation is performed under different SNR environments, both with and without reverberation. Using 0.4-s audio segments, the proposed algorithm can achieve an average classification accuracy of 94.5% for the reverberant library and 95.1% for the nonreverberant library
Keywords :
acoustic signal detection; acoustic signal processing; array signal processing; microphone arrays; neural nets; reverberation; security; signal classification; audio classification; audio segments; discerning threshold; feature extraction; microphone arrays; nonspeech segments; pitch ratio parameter; security monitoring; speech processing; time-delayed neural network; Classification algorithms; Libraries; Microphone arrays; Monitoring; Neural networks; Performance evaluation; Reverberation; Robustness; Security; Speech; Audio classification; beamforming; feature extraction; microphone arrays; security monitoring; speech processing;
fLanguage :
English
Journal_Title :
Instrumentation and Measurement, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9456
Type :
jour
DOI :
10.1109/TIM.2006.876394
Filename :
1658350
Link To Document :
بازگشت