Title :
Musical audio semantic segmentation exploiting analysis of prominent spectral energy peaks and multi-feature refinement
Author :
Romano, P. ; Prandi, G. ; Sarti, A. ; Tubaro, S.
Author_Institution :
Dipt. di Elettron. e Inf., Politec. di Milano, Milano
Abstract :
In this paper we present a novel hierarchical and scalable three-stage algorithm to effectively perform musical audio semantic segmentation. In the first stage, the energy spectrum of the entire audio track is analyzed to find significant energy textures that may characterize different semantic segments; in the second and third stages, tonal and timbric features are used to refine the segmentation by moving or deleting segment boundaries. Experimental results on a set of 58 songs show that our algorithm is able to attain good semantic segmentation just after the first step, with a precision of 64% and a recall of 96%. After second step the precision increases to 79%; the best precision result is obtained after the third step, where a value of 85% is reached. In this step the minimum average recall value of 92% is obtained.
Keywords :
audio signal processing; music; spectral analysis; energy spectrum; multifeature refinement analysis; musical audio semantic segmentation; spectral analysis; timbric feature; tonal feature; Algorithm design and analysis; Change detection algorithms; Clustering algorithms; Data mining; Hidden Markov models; Performance analysis; Phase detection; Scalability; Spectral analysis; Visualization; Audio Novelty Analysis; Audio Structural Analysis; Semantic Music Segmentation;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4959996