Title :
Gender identification using a general audio classifier
Author :
Harb, Hadi ; Chen, Liming
Author_Institution :
Dept. of Mathematiques Informatique, Ecole Centrale de Lyon, France
Abstract :
In the context of content-based multimedia indexing gender identification using speech signal is an important task. Existing techniques are dependent on the quality of the speech signal making them unsuitable for the video indexing problems. In this paper we introduce a novel gender identification approach based on a general audio classifier. The audio classifier models the audio signal by the first order spectrum´s statistics in 1s windows and uses a set of neural networks as classifiers. The presented technique shows robustness to adverse audio compression and it is language independent. We show how practical considerations about the speech in audio-visual data, such as the continuity of speech, can further improve the classification results which attain 92%.
Keywords :
audio signal processing; indexing; neural nets; spectral analysis; speech recognition; audio classifier; audio compression; audio-visual data; content-based multimedia indexing; gender identification; neural networks; spectrum statistics; speech signal; Audio compression; Automatic speech recognition; Context modeling; Indexing; Mel frequency cepstral coefficient; Neural networks; Robustness; Signal processing; Speech recognition; Statistics;
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
DOI :
10.1109/ICME.2003.1221721