Title :
Spectral and Temporal Periodicity Representations of Rhythm for the Automatic Classification of Music Audio Signal
Author :
Peeters, Geoffroy
Author_Institution :
Sound Anal./Synthesis Team, IRCAM, Paris, France
fDate :
7/1/2011 12:00:00 AM
Abstract :
In this paper, we study the spectral and temporal periodicity representations that can be used to describe the characteristics of the rhythm of a music audio signal. A continuous-valued energy-function representing the onset positions over time is first extracted from the audio signal. From this function we compute at each time a vector which represents the characteristics of the local rhythm. Four feature sets are studied for this vector. They are derived from the amplitude of the discrete Fourier transform (DFT), the auto-correlation function (ACF), the product of the DFT and the ACF interpolated on a hybrid lag/frequency axis and the concatenated DFT and ACF coefficients. Then the vectors are sampled at some specific frequencies, which represent various ratios of the local tempo. The ability of these periodicity representations to describe the rhythm characteristics of an audio item is evaluated through a classification task. In this, we test the use of the periodicity representations alone, combined with tempo information and combined with a proposed set of rhythm features. The evaluation is performed using annotated and estimated tempo. We show that using such simple periodicity representations allows achieving high recognition rates at least comparable to previously published results.
Keywords :
audio signal processing; discrete Fourier transforms; signal classification; autocorrelation function; automatic classification; concatenated ACF coefficient; concatenated DFT coefficient; continuous-valued energy-function; discrete Fourier transform; feature sets; hybrid lag-frequency axis; music audio signal; recognition rates; rhythm characteristics; spectral periodicity representation; tempo information; temporal periodicity representation; Accuracy; Discrete Fourier transforms; Histograms; Multiple signal classification; Rhythm; Speech processing; Audio features; automatic indexing; rhythm classification; rhythm description;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2010.2089452