DocumentCode
3144587
Title
Music tempo estimation and beat tracking by applying source separation and metrical relations
Author
Gkiokas, Aggelos ; Katsouros, Vassilis ; Carayannis, George ; Stajylakis, Themos
Author_Institution
Inst. for Language & Speech Process. / R.C. “Athena”, Greece
fYear
2012
fDate
25-30 March 2012
Firstpage
421
Lastpage
424
Abstract
In this paper, we present tempo estimation and beat tracking algorithms by utilizing percussive/harmonic separation of the audio signal, in order to extract filterbank energies and chroma features from the respective components. Periodicity analysis is carried out by the convolution of feature sequences with a bank of resonators. Target tempo is estimated from the resulting periodicity vector by incorporating metrical relations knowledge. Tempo estimation is followed by a local tempo refinement method to enhance the beat-tracking algorithm. Beat tracking involves the computation of the beat saliencies derived from the resonators responses and proposes a distance measure between candidate beats locations. A dynamic programming algorithm is adopted to find the optimal “path” of beats. Both tempo estimation and beat tracking methods were submitted on MIREX 2011, while the tempo estimation algorithm was also evaluated on ISMIR 2004 Tempo Induction Evaluation Exchange Dataset.
Keywords
audio signal processing; convolution; dynamic programming; feature extraction; filtering theory; source separation; tracking; ISMIR 2004 Tempo Induction Evaluation Exchange Dataset; MIREX 2011; audio signal; beat tracking; chroma feature extraction; dynamic programming algorithm; feature sequence convolution; filterbank energy extraction; harmonic separation; local tempo refinement method; metrical relations knowledge; music tempo estimation; percussive separation; periodicity analysis; periodicity vector; resonators response; source separation; target tempo estimation; tempo estimation algorithm; Decision support systems; beat tracking; chroma features; periodicity analysis; tempo estimation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6287906
Filename
6287906
Link To Document