Title :
Rhythm detection for speech-music discrimination in MPEG compressed domain
Author :
Jarina, Roinan ; O´Connor, Noel ; Marlow, Seán ; Murphy, Noel
Author_Institution :
Centre for Digital Video Process., Dublin City Univ., Ireland
Abstract :
A novel approach to speech-music discrimination based on rhythm (or beat) detection is introduced. Rhythmic pulses are detected by applying a long-term autocorrelation method on band-passed signals. This approach is combined with another, in which the features describe the energy peaks of the signal. The discriminator uses just three features that are computed from data directly taken from an MPEG-1 bitstream. The discriminator was tested on more than 3 hours of audio data. Average recognition rate is 97.7%.
Keywords :
audio signal processing; correlation methods; feature extraction; music; speech processing; MPEG-1 bitstream; audio data; band-passed signals; beat detection; energy peaks; long-term autocorrelation method; recognition rate; rhythm; rhythmic pulses; speech-music discrimination; Artificial neural networks; Autocorrelation; Hidden Markov models; Multiple signal classification; Rhythm; Signal processing; Speech; Testing; Transform coding; Video compression;
Conference_Titel :
Digital Signal Processing, 2002. DSP 2002. 2002 14th International Conference on
Print_ISBN :
0-7803-7503-3
DOI :
10.1109/ICDSP.2002.1027851