DocumentCode :
180326
Title :
Sparse cepstral codes and power scale for instrument identification
Author :
Li-Fan Yu ; Li Su ; Yi-Hsuan Yang
Author_Institution :
Res. Center for Inf. Technol. Innovation, Acad. Sinica, Taipei, Taiwan
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
7460
Lastpage :
7464
Abstract :
This paper presents a novel feature representation called sparse cepstral codes for instrument identification. We first motivate the approach by discussing why cepstrum is suitable for instrument identification. Then we propose the use of sparse coding and power normalization to derive compact codes that better represent the information of the cepstrum. Our evaluation on both uni-source and multi-source instrument identification tasks show that the proposed feature leads to significantly better accuracy than existing methods. We further show that cepstrum obtained from power-scaled spectrum can do better than typical cepstrum especially in multi-source signal. The proposed system achieves 0.955 F-score in uni-source dataset and 0.688 F-score in multi-source dataset.
Keywords :
cepstral analysis; encoding; F-score; cepstrum information; compact codes; feature representation; multisource dataset; multisource instrument identification; multisource signal; power normalization; power-scaled spectrum; sparse cepstral codes; uni-source dataset; uni-source instrument identification; Accuracy; Cepstrum; Dictionaries; Instruments; Mel frequency cepstral coefficient; Speech; cepstrum; dictionary learning; instrument identification; power scale; sparse coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6855050
Filename :
6855050
Link To Document :
بازگشت