• DocumentCode
    3122623
  • Title

    Dominant Audio Descriptors for Audio Classification and Retrieval

  • Author

    Fadeev, Aleksey ; Missaoui, Oualid ; Frigui, Hichem

  • Author_Institution
    CECS, Univ. of Louisville, Louisville, KY, USA
  • fYear
    2009
  • fDate
    13-15 Dec. 2009
  • Firstpage
    75
  • Lastpage
    78
  • Abstract
    In this paper, we propose a new general low-level feature representation for audio signals. Our approach, called Dominant Audio Descriptor is inspired by the MPEG-7 Dominant Color Descriptor. It is based on clustering time-local features and identifying dominant components. The features used to illustrate this approach are the well-known Mel Frequency Cepstral Coefficients. The performance of the proposed framework is evaluated on audio classification and retrieval tasks. In particular, the experiments are performed on a benchmark music data set. The results are compared to those previously obtained on the same data base. We show that our approach improved classification and retrieval results by more then 3%, and for the case of retrieval reached almost perfect retrieval rate of 99:36%. In addition, the paper presents comparative results against several state of the art classifiers, such as Hidden Markov Models, Support Vector Machines and k-Nearest Neighbors.
  • Keywords
    audio signal processing; signal classification; MPEG-7 dominant color descriptor; Mel frequency cepstral coefficients; audio classification; audio retrieval; audio signals; dominant audio descriptors; hidden Markov models; k-nearest neighbors; low-level feature representation; support vector machines; Feature extraction; Filters; Hidden Markov models; Humans; MPEG 7 Standard; Machine learning; Mel frequency cepstral coefficient; Music information retrieval; Support vector machines; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications, 2009. ICMLA '09. International Conference on
  • Conference_Location
    Miami Beach, FL
  • Print_ISBN
    978-0-7695-3926-3
  • Type

    conf

  • DOI
    10.1109/ICMLA.2009.120
  • Filename
    5381799