Acoustic feature extraction by tensor-based sparse representation for sound effects classification

Author

Xueyuan Zhang ; Qianhua He ; Xiaohui Feng

Author_Institution

Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China

fYear

2015

fDate

19-24 April 2015

Firstpage

166

Lastpage

170

Abstract

This paper describes a method to extract time-frequency (TF) audio features by tensor-based sparse approximation for sound effects classification. In the proposed method, the observed data is encoded as a higher-order tensor and discriminative features are extracted in spectrotemporal domain. Firstly, audio signals are represented by a joint time-frequency-duration tensor based on sparse approximation; then tensor factorization is applied to calculate feature vectors. The three arrays of the proposed tensor are used to represent frequency, time and duration of transient TF atoms respectively. Experimental results show that exploiting tensor representation allows to characterize distinctive transient TF atoms, yielding an average accuracy improvement of 9.7% and 12.5% compared with matching pursuit (MP) and MFCC features.

Keywords

acoustic signal processing; approximation theory; feature extraction; signal representation; tensors; time-frequency analysis; acoustic feature extraction; joint time-frequency-duration tensor; sound effects classification; sparse approximation; tensor factorization; tensor-based sparse representation; time-frequency audio feature extraction; Approximation methods; Atomic clocks; Dictionaries; Feature extraction; Rivers; Speech; Tensile stress; sound classification; sparse approximation; tensor factorization; time-frequency features;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location

South Brisbane, QLD

Type

conf

DOI

10.1109/ICASSP.2015.7177953

Filename

7177953