Spectral patch based sparse coding for acoustic event detection

Author

Xugang Lu ; Yu Tsao ; Peng Shen ; Hori, Chiori

Author_Institution

Nat. Inst. of Inf. & Commun. Technol., Koganei, Japan

fYear

2014

fDate

12-14 Sept. 2014

Firstpage

317

Lastpage

320

Abstract

In most algorithms for acoustic event detection (AED), frame based acoustic representations are used in acoustic modeling. Due to lack of context information in feature representation, large model confusions may occur during modeling. We have proposed a feature learning and representation algorithm to explore context information from temporal-frequency patches of signal for AED. With the algorithm, a sparse feature was extracted based on an acoustic dictionary composed of a bag of spectral patches. In our previous algorithm, the feature was obtained based on a definition of Euclidian distance between input signal and acoustic dictionary. In this study, we formulate the sparse feature extraction as l₁ regularization in signal reconstruction. The sparsity of the representation is efficiently controlled via varying a regularization parameter. A support vector machine (SVM) classifier was built on the extracted sparse feature for AED. Our experimental results showed that the spectral patch based sparse representation effectively improved the performance by incorporating temporal-frequency context information in modeling.

Keywords

acoustic signal detection; signal reconstruction; speech processing; support vector machines; Euclidian distance; SVM classifier; acoustic dictionary; acoustic event detection; feature learning; feature representation; frame based acoustic representation; regularization parameter; signal reconstruction; sparse coding; sparse feature extraction; spectral patch; support vector machine; temporal-frequency context information; temporal-frequency patches; Acoustics; Dictionaries; Encoding; Event detection; Feature extraction; Hidden Markov models; Support vector machines; Acoustic event detection; sparse coding; support vector machine;

fLanguage

English

Publisher

ieee

Conference_Titel

Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on

Conference_Location

Singapore

Type

conf

DOI

10.1109/ISCSLP.2014.6936651

Filename

6936651