DocumentCode
178620
Title
Acoustic feature extraction by statistics based local binary pattern for environmental sound classification
Author
Kobayashi, Takehiko ; Ye, John
Author_Institution
Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba, Japan
fYear
2014
fDate
4-9 May 2014
Firstpage
3052
Lastpage
3056
Abstract
Classification of environmental sounds is a fundamental procedure for a wide range of real-world applications. In this paper, we propose a novel acoustic feature extraction method for classifying the environmental sounds. The proposed method is motivated from the image processing technique, local binary pattern (LBP), and works on a spectrogram which forms two-dimensional (time-frequency) data like an image. Since the spectrogram contains noisy pixel values, for improving classification performance, it is crucial to extract the features which are robust to the fluctuations in pixel values. We effectively incorporate the local statistics, mean and standard deviation on local pixels, to establish robust LBP. In addition, we provide the technique of L2-Hellinger normalization which is efficiently applied to the proposed features so as to further enhance the discriminative power while increasing the robustness. In the experiments on environmental sound classification using RWCP dataset that contains 105 sound categories, the proposed method produces the superior performance (98.62%) compared to the other methods, exhibiting significant improvements over the standard LBP method as well as robustness to noise and low computation time.
Keywords
acoustic noise; feature extraction; image processing; pattern classification; L2-Hellinger normalization; RWCP dataset; acoustic feature extraction; environmental sound classification; image processing technique; local binary pattern; local statistics; mean deviation; noisy pixel values; spectrogram; standard deviation; time-frequency data; Acoustics; Feature extraction; Noise; Robustness; Spectrogram; Standards; Time-frequency analysis; classification; environmental sound; local binary pattern; spectrogram;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854161
Filename
6854161
Link To Document