• DocumentCode
    3151978
  • Title

    Cluster aware normalization for enhancing audio similarity

  • Author

    Lagrange, Mathieu ; Martins, Luis Gustavo ; Tzanetakis, George

  • Author_Institution
    IRCAM, UPMC, Paris, France
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    1969
  • Lastpage
    1972
  • Abstract
    An important task in Music Information Retrieval is content-based similarity retrieval in which given a query music track, a set of tracks that are similar in terms of musical content are retrieved. A variety of audio features that attempt to model different aspects of the music have been proposed. In most cases the resulting audio feature vector used to represent each music track is high dimensional. It has been observed that high dimensional music similarity spaces exhibit some anomalies: hubs which are tracks that are similar to many other tracks, and orphans which are tracks that are not similar to most other tracks. These anomalies are an artifact of the high dimensional representation rather than actually based on the musical content. In this work we describe a distance normalization method that is shown to reduce the number of hubs and orphans. It is based on post-processing the similarity matrix that encodes the pair-wise track similarities and utilizes clustering to adapt the distance normalization to the local structure of the feature space.
  • Keywords
    audio signal processing; information retrieval; matrix algebra; music; audio feature vector; audio similarity; cluster aware normalization; content-based similarity retrieval; distance normalization; feature space; high dimensional music similarity spaces; high dimensional representation; local structure; music information retrieval; musical content; pairwise track similarities; similarity matrix; Accuracy; Educational institutions; Indexes; Kernel; Measurement; Speech; Vectors; distance normalization; information retrieval; kernel-based clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288292
  • Filename
    6288292