Title :
Fusion of descriptors for speech / music classification
Author :
Mauclair, Julie ; Pinquier, Julien
Author_Institution :
Lab. d´Inf., Univ. du Maine, Le Mans, France
Abstract :
This work addresses the soundtrack indexing of multimedia documents. We present a speech/music classification system based on three original features: entropy modulation, stationary segment duration and number of segments. They were merged by basic score maximisation with the classical 4 Hertz modulation energy. We validate this fusion approach with the use of the probability theory and the evidence theory. The system is tested on radio corpora. Systems are simple, robust and could be improved on every corpus without training or adaptation.
Keywords :
document handling; entropy; indexing; inference mechanisms; multimedia computing; music; probability; sensor fusion; signal classification; speech processing; basic score maximisation; descriptor fusion; entropy modulation; evidence theory; modulation energy; multimedia documents; music classification system; number-of-segments; probability theory; radio corpora; soundtrack indexing; speech classification system; stationary segment duration; Abstracts; Entropy; Filtering theory; Reliability; Speech;
Conference_Titel :
Signal Processing Conference, 2004 12th European
Conference_Location :
Vienna
Print_ISBN :
978-320-0001-65-7