• DocumentCode
    754239
  • Title

    Analysis of Minimum Distances in High-Dimensional Musical Spaces

  • Author

    Casey, Michael ; Rhodes, Christophe ; Slaney, Malcolm

  • Author_Institution
    Goldsmiths Coll., Dept. of Comput., Univ. of London, London
  • Volume
    16
  • Issue
    5
  • fYear
    2008
  • fDate
    7/1/2008 12:00:00 AM
  • Firstpage
    1015
  • Lastpage
    1028
  • Abstract
    We propose an automatic method for measuring content-based music similarity, enhancing the current generation of music search engines and recommended systems. Many previous approaches to track similarity require brute-force, pair-wise processing between all audio features in a database and therefore are not practical for large collections. However, in an Internet-connected world, where users have access to millions of musical tracks, efficiency is crucial. Our approach uses features extracted from unlabeled audio data and near-neigbor retrieval using a distance threshold, determined by analysis, to solve a range of retrieval tasks. The tasks require temporal features-analogous to the technique of shingling used for text retrieval. To measure similarity, we count pairs of audio shingles, between a query and target track, that are below a distance threshold. The distribution of between-shingle distances is different for each database; therefore, we present an analysis of the distribution of minimum distances between shingles and a method for estimating a distance threshold for optimal retrieval performance. The method is compatible with locality-sensitive hashing (LSH)-allowing implementation with retrieval times several orders of magnitude faster than those using exhaustive distance computations. We evaluate the performance of our proposed method on three contrasting music similarity tasks: retrieval of mis-attributed recordings (fingerprint), retrieval of the same work performed by different artists (cover songs), and retrieval of edited and sampled versions of a query track by remix artists (remixes). Our method achieves near-perfect performance in the first two tasks and 75% precision at 70% recall in the third task. Each task was performed on a test database comprising 4.5 million audio shingles.
  • Keywords
    audio signal processing; feature extraction; information retrieval; Internet-connected world; audio features; automatic method; content-based music similarity; features extraction; high-dimensional musical spaces; locality-sensitive hashing; minimum distances; music search engines; musical tracks; optimal retrieval; pair-wise processing; query track; remix artists; retrieval tasks; test database; text retrieval; Audio databases; Current measurement; Data mining; Feature extraction; Internet; Music information retrieval; Performance evaluation; Search engines; Spatial databases; Target tracking; audio shingles; distance distributions; locality-sensitive hashing; matched-filter distance; music similarity;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2008.925883
  • Filename
    4544816