• DocumentCode
    452904
  • Title

    Security-Monitoring using Microphone Arrays and Audio Classification

  • Author

    Abu-El-Quran, A.R. ; Goubran, R.A.

  • Author_Institution
    Dept. of Syst. & Comput. Eng., Carleton Univ., Ottawa, Ont.
  • Volume
    2
  • fYear
    2005
  • fDate
    16-19 May 2005
  • Firstpage
    1144
  • Lastpage
    1148
  • Abstract
    This paper proposes a security-monitoring instrument that can detect and classify the location and nature of different sounds in a room. The instrument is reliable and robust even in the presence of reverberation and in low signal to noise ratio conditions. This paper proposes a new algorithm for classifying first an audio segment as speech or nonspeech then classifies the nonspeech audio segment into its own audio type. The algorithm divides an audio segment into frames, estimates the presence of pitch in each frame, and calculates a pitch ratio parameter. This parameter is then used to classify the audio segment. The threshold used in calculating this parameter is adapted to accommodate different environments. Nonspeech audio segment has further classification using time delayed neural network to be classified into it is own type. The performance of the proposed algorithm is evaluated for different signal-to-noise ratios using a library of audio segments. The library includes speech segments and nonspeech segments such as windows breaking and footsteps. Using 0.4 second segments it is shown that the proposed algorithm can achieve an average correct decision for 94.5% of the reverberant library and 95.1% of the nonreverberant library
  • Keywords
    array signal processing; audio signal processing; feature extraction; microphone arrays; neural nets; security; signal classification; signal processing equipment; speech processing; audio classification; audio segment; feature extraction; microphone arrays; nonreverberant library; nonspeech segments; pitch ratio parameter; reverberant library; reverberation; security-monitoring instrument; speech processing; speech segments; time delayed neural network; Acoustic noise; Change detection algorithms; Feature extraction; Instruments; Libraries; Microphone arrays; Noise cancellation; Reverberation; Speech; Working environment noise; Feature extraction; audio classification; beam-forming; microphone arrays; security-monitoring; speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Instrumentation and Measurement Technology Conference, 2005. IMTC 2005. Proceedings of the IEEE
  • Conference_Location
    Ottawa, Ont.
  • Print_ISBN
    0-7803-8879-8
  • Type

    conf

  • DOI
    10.1109/IMTC.2005.1604323
  • Filename
    1604323