مرکز منطقه ای اطلاع رساني علوم و فناوري - Sparse representation based on a bag of spectral exemplars for acoustic event detection

DocumentCode :

179855

Title :

Sparse representation based on a bag of spectral exemplars for acoustic event detection

Author :

Xugang Lu ; Yu Tsao ; Matsuda, Shodai ; Hori, Chiori

Author_Institution :

Nat. Inst. of Inf. & Commun. Technol., Koganei, Japan

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

6255

Lastpage :

6259

Abstract :

Acoustic event detection is an important step for audio content analysis and retrieval. Traditional detection techniques model the acoustic events on frame-based spectral features. Considering the temporal-frequency structures of acoustic events may be distributed in time-scales beyond frames, we propose to represent those structures as a bag of spectral patch exemplars. In order to learn the representative exemplars, k-means clustering based vector quantization (VQ) was applied on the whitened spectral patches which makes the learned exemplars focus on high-order statistical structure. With the learned spectral exemplars, a sparse feature representation is extracted based on the similarity measurement to the learned exemplars. A support vector machine (SVM) classifier was built on the sparse representation for acoustic event detection. Our experimental results showed that the sparse representation based on the patch based exemplars significantly improved the performance compared with traditional frame based representations.

Keywords :

acoustic transducers; audio streaming; feature extraction; higher order statistics; support vector machines; vector quantisation; SVM classifier; VQ; acoustic event detection; audio content analysis; audio content retrieval; audio stream; detection techniques model; frame-based spectral features; high-order statistical structure; k-means clustering based vector quantization; learned exemplars; sparse feature representation extraction; spectral patch exemplars. bag; support vector machine classifier; temporal-frequency structures; Acoustics; Event detection; Feature extraction; Hidden Markov models; Speech; Support vector machines; Vectors; Sparse representation; acoustic event detection; support vector machine; vector quantization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6854807

Filename :

6854807

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=179855