DocumentCode
134258
Title
Spectral patch based sparse coding for acoustic event detection
Author
Xugang Lu ; Yu Tsao ; Peng Shen ; Hori, Chiori
Author_Institution
Nat. Inst. of Inf. & Commun. Technol., Koganei, Japan
fYear
2014
fDate
12-14 Sept. 2014
Firstpage
317
Lastpage
320
Abstract
In most algorithms for acoustic event detection (AED), frame based acoustic representations are used in acoustic modeling. Due to lack of context information in feature representation, large model confusions may occur during modeling. We have proposed a feature learning and representation algorithm to explore context information from temporal-frequency patches of signal for AED. With the algorithm, a sparse feature was extracted based on an acoustic dictionary composed of a bag of spectral patches. In our previous algorithm, the feature was obtained based on a definition of Euclidian distance between input signal and acoustic dictionary. In this study, we formulate the sparse feature extraction as l1 regularization in signal reconstruction. The sparsity of the representation is efficiently controlled via varying a regularization parameter. A support vector machine (SVM) classifier was built on the extracted sparse feature for AED. Our experimental results showed that the spectral patch based sparse representation effectively improved the performance by incorporating temporal-frequency context information in modeling.
Keywords
acoustic signal detection; signal reconstruction; speech processing; support vector machines; Euclidian distance; SVM classifier; acoustic dictionary; acoustic event detection; feature learning; feature representation; frame based acoustic representation; regularization parameter; signal reconstruction; sparse coding; sparse feature extraction; spectral patch; support vector machine; temporal-frequency context information; temporal-frequency patches; Acoustics; Dictionaries; Encoding; Event detection; Feature extraction; Hidden Markov models; Support vector machines; Acoustic event detection; sparse coding; support vector machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location
Singapore
Type
conf
DOI
10.1109/ISCSLP.2014.6936651
Filename
6936651
Link To Document