• DocumentCode
    3569583
  • Title

    Detection and clustering of musical audio parts using Fisher linear semi-discriminant analysis

  • Author

    Giannakopoulos, Theodoros ; Petridis, Sergios

  • Author_Institution
    Comput. Intell. Lab., NCSR Demokritos, Aghia Paraskevi, Greece
  • fYear
    2012
  • Firstpage
    1289
  • Lastpage
    1293
  • Abstract
    We present a method aiming at facilitating musical audio summarization by organizing the signal into a set of possibly recurring parts, such that inclusion of an expert from each part would be adequate to compactly summarize the whole audio signal. Crucial to the success of the grouping segments into parts is the underlying distance metric, which depends on the feature space and should provide distances that are low for segments of the same audio part and high for segments of different audio parts. Starting with a general purpose audio feature space, we use the information from the sequential structure of audio signals, in order to estimate in a completely unsupervised way a Fischer subspace with discriminant characteristics for the particular audio signal. The derived feature space is used in a segmentation-clustering system based on fuzzy clustering, HMM and k-NN probability estimation. The experimental results show an almost 10% performance gain when adopting the Fisher subspace with respect to using the original feature space.
  • Keywords
    audio signal processing; estimation theory; fuzzy set theory; hidden Markov models; learning (artificial intelligence); pattern clustering; probability; signal detection; Fischer subspace; Fisher linear semidiscriminant analysis; HMM; audio segmentation; audio signal sequential structure; fuzzy clustering; k-NN probability estimation; musical audio clustering; musical audio detection; musical audio summarization; Clustering algorithms; Entropy; Estimation; Feature extraction; Hidden Markov models; Indexes; Vectors; Fischer discriminant analysis; audio analysis; clustering; music summarisation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
  • ISSN
    2219-5491
  • Print_ISBN
    978-1-4673-1068-0
  • Type

    conf

  • Filename
    6334311