DocumentCode :
179855
Title :
Sparse representation based on a bag of spectral exemplars for acoustic event detection
Author :
Xugang Lu ; Yu Tsao ; Matsuda, Shodai ; Hori, Chiori
Author_Institution :
Nat. Inst. of Inf. & Commun. Technol., Koganei, Japan
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
6255
Lastpage :
6259
Abstract :
Acoustic event detection is an important step for audio content analysis and retrieval. Traditional detection techniques model the acoustic events on frame-based spectral features. Considering the temporal-frequency structures of acoustic events may be distributed in time-scales beyond frames, we propose to represent those structures as a bag of spectral patch exemplars. In order to learn the representative exemplars, k-means clustering based vector quantization (VQ) was applied on the whitened spectral patches which makes the learned exemplars focus on high-order statistical structure. With the learned spectral exemplars, a sparse feature representation is extracted based on the similarity measurement to the learned exemplars. A support vector machine (SVM) classifier was built on the sparse representation for acoustic event detection. Our experimental results showed that the sparse representation based on the patch based exemplars significantly improved the performance compared with traditional frame based representations.
Keywords :
acoustic transducers; audio streaming; feature extraction; higher order statistics; support vector machines; vector quantisation; SVM classifier; VQ; acoustic event detection; audio content analysis; audio content retrieval; audio stream; detection techniques model; frame-based spectral features; high-order statistical structure; k-means clustering based vector quantization; learned exemplars; sparse feature representation extraction; spectral patch exemplars. bag; support vector machine classifier; temporal-frequency structures; Acoustics; Event detection; Feature extraction; Hidden Markov models; Speech; Support vector machines; Vectors; Sparse representation; acoustic event detection; support vector machine; vector quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854807
Filename :
6854807
Link To Document :
بازگشت