DocumentCode
2184720
Title
Cochleagram image feature for improved robustness in sound recognition
Author
Sharan, Roneel V. ; Moir, Tom J.
Author_Institution
School of Engineering, Auckland University of Technology, Private Bag 92006, 1142, New Zealand
fYear
2015
fDate
21-24 July 2015
Firstpage
441
Lastpage
444
Abstract
In this paper, we use the cochleagram image of sound signals for time-frequency analysis and feature extraction, instead of the conventional spectrogram image, in an audio surveillance application. The signal is firstly passed through a gammatone filter which models the auditory filters in the human cochlea. The filtered signal is then divided into small windows and the energy in each window is added and normalized which gives the intensity values of the cochleagram image. We then divide the cochleagram image into blocks and extract central moments as features. Using two feature vector representation methods, the results show significant improvement in overall classification accuracy when compared to results from literature employing similar feature extraction and representation techniques but using spectrogram images. The most improved results were at low signal-to-noise ratios.
Keywords
Image recognition; Image resolution; Noise; Speech; Speech recognition; audio surveillance; central moments; cochleagram; sound recognition; support vector machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Signal Processing (DSP), 2015 IEEE International Conference on
Conference_Location
Singapore, Singapore
Type
conf
DOI
10.1109/ICDSP.2015.7251910
Filename
7251910
Link To Document