• DocumentCode
    1037751
  • Title

    Computational Models of Similarity for Drum Samples

  • Author

    Pampalk, Elias ; Herrera, Perfecto ; Goto, Masataka

  • Author_Institution
    Nat. Inst. of Adv. Ind. Sci. & Technol. (AIST), Tsukuba
  • Volume
    16
  • Issue
    2
  • fYear
    2008
  • Firstpage
    408
  • Lastpage
    423
  • Abstract
    In this paper, we optimize and evaluate computational models of similarity for sounds from the same instrument class. We investigate four instrument classes: bass drums, snare drums, high-pitched toms, and low-pitched toms. We evaluate two similarity models: one is defined in the ISO/IEC MPEG-7 standard, and the other is based on auditory images. For the second model, we study the impact of various parameters. We use data from listening tests, and instrument class labels to evaluate the models. Our results show that the model based on auditory images yields a very high average correlation with human similarity ratings and clearly outperforms the MPEG-7 recommendation. The average correlations range from 0.89-0.96 depending on the instrument class. Furthermore, our results indicate that instrument class data can be used as alternative to data from listening tests to evaluate sound similarity models.
  • Keywords
    IEC standards; ISO standards; musical instruments; video coding; ISO-IEC MPEG-7 standard; auditory images; bass drums; computational models; drum samples; high-pitched toms; low-pitched toms; snare drums; sound similarity models; Acoustic testing; Computational modeling; Computer interfaces; Humans; IEC standards; ISO standards; Instruments; Libraries; MPEG 7 Standard; Music information retrieval; Content-based similarity; drum sounds; percussive music instruments;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.910783
  • Filename
    4432650