DocumentCode :
134258
Title :
Spectral patch based sparse coding for acoustic event detection
Author :
Xugang Lu ; Yu Tsao ; Peng Shen ; Hori, Chiori
Author_Institution :
Nat. Inst. of Inf. & Commun. Technol., Koganei, Japan
fYear :
2014
fDate :
12-14 Sept. 2014
Firstpage :
317
Lastpage :
320
Abstract :
In most algorithms for acoustic event detection (AED), frame based acoustic representations are used in acoustic modeling. Due to lack of context information in feature representation, large model confusions may occur during modeling. We have proposed a feature learning and representation algorithm to explore context information from temporal-frequency patches of signal for AED. With the algorithm, a sparse feature was extracted based on an acoustic dictionary composed of a bag of spectral patches. In our previous algorithm, the feature was obtained based on a definition of Euclidian distance between input signal and acoustic dictionary. In this study, we formulate the sparse feature extraction as l1 regularization in signal reconstruction. The sparsity of the representation is efficiently controlled via varying a regularization parameter. A support vector machine (SVM) classifier was built on the extracted sparse feature for AED. Our experimental results showed that the spectral patch based sparse representation effectively improved the performance by incorporating temporal-frequency context information in modeling.
Keywords :
acoustic signal detection; signal reconstruction; speech processing; support vector machines; Euclidian distance; SVM classifier; acoustic dictionary; acoustic event detection; feature learning; feature representation; frame based acoustic representation; regularization parameter; signal reconstruction; sparse coding; sparse feature extraction; spectral patch; support vector machine; temporal-frequency context information; temporal-frequency patches; Acoustics; Dictionaries; Encoding; Event detection; Feature extraction; Hidden Markov models; Support vector machines; Acoustic event detection; sparse coding; support vector machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
Type :
conf
DOI :
10.1109/ISCSLP.2014.6936651
Filename :
6936651
Link To Document :
بازگشت