مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust speech music discrimination using spectrum´s first order statistics and neural networks

DocumentCode :

1943576

Title :

Robust speech music discrimination using spectrum´s first order statistics and neural networks

Author :

Harb, Hadi ; Chen, Liming

Author_Institution :

Dept. of Mathematiques Informatiques, Ecole Centrale de Lyon, Ecully, France

Volume :

fYear :

2003

fDate :

1-4 July 2003

Firstpage :

125

Abstract :

Most of speech/music discrimination techniques proposed in the literature need a great amount of training data in order to provide acceptable results. Besides, they are usually context-dependent. In this paper, we propose a novel technique for speech/music discrimination which relies on first order sound spectrum´s statistics as feature vector and a neural network for classification. Experiments driven on 20000 seconds of various audio data show that the proposed technique has a great ability of generalization since a classification accuracy of 96% has been achieved only after a training phase on 80 seconds audio data. Furthermore, the proposed technique is context-independent as it can be applied to various audio sources.

Keywords :

audio signal processing; music; neural nets; speech processing; statistics; vectors; audio signals; audio sources; first order statistics; neural networks; speech music discrimination; training data; vector; Hidden Markov models; Humans; Indexing; Multiple signal classification; Neural networks; Robustness; Speech; Statistics; Streaming media; Training data;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signal Processing and Its Applications, 2003. Proceedings. Seventh International Symposium on

Print_ISBN :

0-7803-7946-2

Type :

conf

DOI :

10.1109/ISSPA.2003.1224831

Filename :

1224831

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1943576