DocumentCode :
3413080
Title :
Acoustic feature extraction by tensor-based sparse representation for sound effects classification
Author :
Xueyuan Zhang ; Qianhua He ; Xiaohui Feng
Author_Institution :
Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
166
Lastpage :
170
Abstract :
This paper describes a method to extract time-frequency (TF) audio features by tensor-based sparse approximation for sound effects classification. In the proposed method, the observed data is encoded as a higher-order tensor and discriminative features are extracted in spectrotemporal domain. Firstly, audio signals are represented by a joint time-frequency-duration tensor based on sparse approximation; then tensor factorization is applied to calculate feature vectors. The three arrays of the proposed tensor are used to represent frequency, time and duration of transient TF atoms respectively. Experimental results show that exploiting tensor representation allows to characterize distinctive transient TF atoms, yielding an average accuracy improvement of 9.7% and 12.5% compared with matching pursuit (MP) and MFCC features.
Keywords :
acoustic signal processing; approximation theory; feature extraction; signal representation; tensors; time-frequency analysis; acoustic feature extraction; joint time-frequency-duration tensor; sound effects classification; sparse approximation; tensor factorization; tensor-based sparse representation; time-frequency audio feature extraction; Approximation methods; Atomic clocks; Dictionaries; Feature extraction; Rivers; Speech; Tensile stress; sound classification; sparse approximation; tensor factorization; time-frequency features;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7177953
Filename :
7177953
Link To Document :
بازگشت