DocumentCode
179733
Title
Generalized Gaussian Distribution Kullback-Leibler kernel for robust sound event recognition
Author
Dat, Tran Huy ; Terence, Ng Wen Zheng ; Dennis, Jonathan William ; Leng Yi Ren
Author_Institution
Human Language Technol. Dept., Inst. for Infocomm Res., Singapore, Singapore
fYear
2014
fDate
4-9 May 2014
Firstpage
5949
Lastpage
5953
Abstract
In previous works, we have developed a spectrogram image feature extraction framework for robust sound event recognition. The basic idea here is to extract useful information from the 2D time-frequency representation of the sound signal to build up specific feature extractions and classifier under noisy conditions. In this paper, we propose a novel robust spectrogram image method where the key is the observed sparsity of the sound spectrogram image in wavelet representations, which is modeled by the Generalized Gaussian Distributions modeling. Furthermore, the Generalized Gaussian Distribution Kullback-Leibler (GGD-KL) kernel SVM is developed to embed the given probabilistic distance into the quadratic programming machine to optimize the classification The experimental result shows the superiority of the proposed method to the previous works and the state-of-the-art in the field.
Keywords
Gaussian distribution; feature extraction; image recognition; quadratic programming; support vector machines; time-frequency analysis; wavelet transforms; 2D time-frequency representation; GGD-KL kernel; SVM; generalized Gaussian distribution Kullback-Leibler kernel; noisy conditions; observed sparsity; probabilistic distance; quadratic programming machine; robust sound event recognition; robust spectrogram image method; sound signal; sound spectrogram image; spectrogram image feature extraction; wavelet representations; Acoustics; Conferences; Decision support systems; Speech; Speech processing; Generalized Gaussian Distribution; Kernel; Kullback-Leiber Distance; Sound Event Recognition; Spectrogram; Wavelet;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854745
Filename
6854745
Link To Document