DocumentCode :
178620
Title :
Acoustic feature extraction by statistics based local binary pattern for environmental sound classification
Author :
Kobayashi, Takehiko ; Ye, John
Author_Institution :
Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba, Japan
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
3052
Lastpage :
3056
Abstract :
Classification of environmental sounds is a fundamental procedure for a wide range of real-world applications. In this paper, we propose a novel acoustic feature extraction method for classifying the environmental sounds. The proposed method is motivated from the image processing technique, local binary pattern (LBP), and works on a spectrogram which forms two-dimensional (time-frequency) data like an image. Since the spectrogram contains noisy pixel values, for improving classification performance, it is crucial to extract the features which are robust to the fluctuations in pixel values. We effectively incorporate the local statistics, mean and standard deviation on local pixels, to establish robust LBP. In addition, we provide the technique of L2-Hellinger normalization which is efficiently applied to the proposed features so as to further enhance the discriminative power while increasing the robustness. In the experiments on environmental sound classification using RWCP dataset that contains 105 sound categories, the proposed method produces the superior performance (98.62%) compared to the other methods, exhibiting significant improvements over the standard LBP method as well as robustness to noise and low computation time.
Keywords :
acoustic noise; feature extraction; image processing; pattern classification; L2-Hellinger normalization; RWCP dataset; acoustic feature extraction; environmental sound classification; image processing technique; local binary pattern; local statistics; mean deviation; noisy pixel values; spectrogram; standard deviation; time-frequency data; Acoustics; Feature extraction; Noise; Robustness; Spectrogram; Standards; Time-frequency analysis; classification; environmental sound; local binary pattern; spectrogram;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854161
Filename :
6854161
Link To Document :
بازگشت