Title :
Detection and clustering of musical audio parts using Fisher linear semi-discriminant analysis
Author :
Giannakopoulos, Theodoros ; Petridis, Sergios
Author_Institution :
Comput. Intell. Lab., NCSR Demokritos, Aghia Paraskevi, Greece
Abstract :
We present a method aiming at facilitating musical audio summarization by organizing the signal into a set of possibly recurring parts, such that inclusion of an expert from each part would be adequate to compactly summarize the whole audio signal. Crucial to the success of the grouping segments into parts is the underlying distance metric, which depends on the feature space and should provide distances that are low for segments of the same audio part and high for segments of different audio parts. Starting with a general purpose audio feature space, we use the information from the sequential structure of audio signals, in order to estimate in a completely unsupervised way a Fischer subspace with discriminant characteristics for the particular audio signal. The derived feature space is used in a segmentation-clustering system based on fuzzy clustering, HMM and k-NN probability estimation. The experimental results show an almost 10% performance gain when adopting the Fisher subspace with respect to using the original feature space.
Keywords :
audio signal processing; estimation theory; fuzzy set theory; hidden Markov models; learning (artificial intelligence); pattern clustering; probability; signal detection; Fischer subspace; Fisher linear semidiscriminant analysis; HMM; audio segmentation; audio signal sequential structure; fuzzy clustering; k-NN probability estimation; musical audio clustering; musical audio detection; musical audio summarization; Clustering algorithms; Entropy; Estimation; Feature extraction; Hidden Markov models; Indexes; Vectors; Fischer discriminant analysis; audio analysis; clustering; music summarisation;
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Print_ISBN :
978-1-4673-1068-0