DocumentCode :
3414018
Title :
On the use of the tempogram to describe audio content and its application to Music structural segmentation
Author :
Tian, Mi ; Fazekas, Gyorgy ; Black, Dawn A. A. ; Sandler, Mark
Author_Institution :
Centre for Digital Music, Queen Mary, Univ. of London, London, UK
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
419
Lastpage :
423
Abstract :
This paper presents a new set of audio features to describe music content based on tempo cues. Tempogram, a mid-level representation of tempo information, is constructed to characterize tempo variation and local pulse in the audio signal. We introduce a collection of novel tempogram-based features inspired by musicological hypotheses about the relation between music structure and its rhythmic components prominent at different metrical levels. The strength of these features is demonstrated in music structural segmentation, an important task in Music information retrieval (MIR), using several published popular music datasets. Results indicate that incorporating tempo information into audio segmentation is a promising new direction.
Keywords :
acoustic signal processing; audio signal processing; music; Music information retrieval; audio music content; audio segmentation; music structural segmentation; music structure; musicological hypotheses; rhythmic components; tempo cues; tempo information; tempo variation; tempogram; Feature extraction; Music information retrieval; Rhythm; Speech; Speech processing; Audio signal processing; music segmentation; rhythm feature extraction; tempogram;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178003
Filename :
7178003
Link To Document :
بازگشت