• DocumentCode
    2184720
  • Title

    Cochleagram image feature for improved robustness in sound recognition

  • Author

    Sharan, Roneel V. ; Moir, Tom J.

  • Author_Institution
    School of Engineering, Auckland University of Technology, Private Bag 92006, 1142, New Zealand
  • fYear
    2015
  • fDate
    21-24 July 2015
  • Firstpage
    441
  • Lastpage
    444
  • Abstract
    In this paper, we use the cochleagram image of sound signals for time-frequency analysis and feature extraction, instead of the conventional spectrogram image, in an audio surveillance application. The signal is firstly passed through a gammatone filter which models the auditory filters in the human cochlea. The filtered signal is then divided into small windows and the energy in each window is added and normalized which gives the intensity values of the cochleagram image. We then divide the cochleagram image into blocks and extract central moments as features. Using two feature vector representation methods, the results show significant improvement in overall classification accuracy when compared to results from literature employing similar feature extraction and representation techniques but using spectrogram images. The most improved results were at low signal-to-noise ratios.
  • Keywords
    Image recognition; Image resolution; Noise; Speech; Speech recognition; audio surveillance; central moments; cochleagram; sound recognition; support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing (DSP), 2015 IEEE International Conference on
  • Conference_Location
    Singapore, Singapore
  • Type

    conf

  • DOI
    10.1109/ICDSP.2015.7251910
  • Filename
    7251910