Dominant Audio Descriptors for Audio Classification and Retrieval

Author

Fadeev, Aleksey ; Missaoui, Oualid ; Frigui, Hichem

Author_Institution

CECS, Univ. of Louisville, Louisville, KY, USA

fYear

2009

fDate

13-15 Dec. 2009

Firstpage

75

Lastpage

78

Abstract

In this paper, we propose a new general low-level feature representation for audio signals. Our approach, called Dominant Audio Descriptor is inspired by the MPEG-7 Dominant Color Descriptor. It is based on clustering time-local features and identifying dominant components. The features used to illustrate this approach are the well-known Mel Frequency Cepstral Coefficients. The performance of the proposed framework is evaluated on audio classification and retrieval tasks. In particular, the experiments are performed on a benchmark music data set. The results are compared to those previously obtained on the same data base. We show that our approach improved classification and retrieval results by more then 3%, and for the case of retrieval reached almost perfect retrieval rate of 99:36%. In addition, the paper presents comparative results against several state of the art classifiers, such as Hidden Markov Models, Support Vector Machines and k-Nearest Neighbors.

Keywords

audio signal processing; signal classification; MPEG-7 dominant color descriptor; Mel frequency cepstral coefficients; audio classification; audio retrieval; audio signals; dominant audio descriptors; hidden Markov models; k-nearest neighbors; low-level feature representation; support vector machines; Feature extraction; Filters; Hidden Markov models; Humans; MPEG 7 Standard; Machine learning; Mel frequency cepstral coefficient; Music information retrieval; Support vector machines; Time frequency analysis;

fLanguage

English

Publisher

ieee

Conference_Titel

Machine Learning and Applications, 2009. ICMLA '09. International Conference on

Conference_Location

Miami Beach, FL

Print_ISBN

978-0-7695-3926-3

Type

conf

DOI

10.1109/ICMLA.2009.120

Filename

5381799