Title :
Robust speech music discrimination using spectrum´s first order statistics and neural networks
Author :
Harb, Hadi ; Chen, Liming
Author_Institution :
Dept. of Mathematiques Informatiques, Ecole Centrale de Lyon, Ecully, France
Abstract :
Most of speech/music discrimination techniques proposed in the literature need a great amount of training data in order to provide acceptable results. Besides, they are usually context-dependent. In this paper, we propose a novel technique for speech/music discrimination which relies on first order sound spectrum´s statistics as feature vector and a neural network for classification. Experiments driven on 20000 seconds of various audio data show that the proposed technique has a great ability of generalization since a classification accuracy of 96% has been achieved only after a training phase on 80 seconds audio data. Furthermore, the proposed technique is context-independent as it can be applied to various audio sources.
Keywords :
audio signal processing; music; neural nets; speech processing; statistics; vectors; audio signals; audio sources; first order statistics; neural networks; speech music discrimination; training data; vector; Hidden Markov models; Humans; Indexing; Multiple signal classification; Neural networks; Robustness; Speech; Statistics; Streaming media; Training data;
Conference_Titel :
Signal Processing and Its Applications, 2003. Proceedings. Seventh International Symposium on
Print_ISBN :
0-7803-7946-2
DOI :
10.1109/ISSPA.2003.1224831