DocumentCode
3413080
Title
Acoustic feature extraction by tensor-based sparse representation for sound effects classification
Author
Xueyuan Zhang ; Qianhua He ; Xiaohui Feng
Author_Institution
Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
fYear
2015
fDate
19-24 April 2015
Firstpage
166
Lastpage
170
Abstract
This paper describes a method to extract time-frequency (TF) audio features by tensor-based sparse approximation for sound effects classification. In the proposed method, the observed data is encoded as a higher-order tensor and discriminative features are extracted in spectrotemporal domain. Firstly, audio signals are represented by a joint time-frequency-duration tensor based on sparse approximation; then tensor factorization is applied to calculate feature vectors. The three arrays of the proposed tensor are used to represent frequency, time and duration of transient TF atoms respectively. Experimental results show that exploiting tensor representation allows to characterize distinctive transient TF atoms, yielding an average accuracy improvement of 9.7% and 12.5% compared with matching pursuit (MP) and MFCC features.
Keywords
acoustic signal processing; approximation theory; feature extraction; signal representation; tensors; time-frequency analysis; acoustic feature extraction; joint time-frequency-duration tensor; sound effects classification; sparse approximation; tensor factorization; tensor-based sparse representation; time-frequency audio feature extraction; Approximation methods; Atomic clocks; Dictionaries; Feature extraction; Rivers; Speech; Tensile stress; sound classification; sparse approximation; tensor factorization; time-frequency features;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7177953
Filename
7177953
Link To Document