Title :
On the use of the tempogram to describe audio content and its application to Music structural segmentation
Author :
Tian, Mi ; Fazekas, Gyorgy ; Black, Dawn A. A. ; Sandler, Mark
Author_Institution :
Centre for Digital Music, Queen Mary, Univ. of London, London, UK
Abstract :
This paper presents a new set of audio features to describe music content based on tempo cues. Tempogram, a mid-level representation of tempo information, is constructed to characterize tempo variation and local pulse in the audio signal. We introduce a collection of novel tempogram-based features inspired by musicological hypotheses about the relation between music structure and its rhythmic components prominent at different metrical levels. The strength of these features is demonstrated in music structural segmentation, an important task in Music information retrieval (MIR), using several published popular music datasets. Results indicate that incorporating tempo information into audio segmentation is a promising new direction.
Keywords :
acoustic signal processing; audio signal processing; music; Music information retrieval; audio music content; audio segmentation; music structural segmentation; music structure; musicological hypotheses; rhythmic components; tempo cues; tempo information; tempo variation; tempogram; Feature extraction; Music information retrieval; Rhythm; Speech; Speech processing; Audio signal processing; music segmentation; rhythm feature extraction; tempogram;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178003