DocumentCode :
699492
Title :
Fusion of descriptors for speech / music classification
Author :
Mauclair, Julie ; Pinquier, Julien
Author_Institution :
Lab. d´Inf., Univ. du Maine, Le Mans, France
fYear :
2004
fDate :
6-10 Sept. 2004
Firstpage :
1285
Lastpage :
1288
Abstract :
This work addresses the soundtrack indexing of multimedia documents. We present a speech/music classification system based on three original features: entropy modulation, stationary segment duration and number of segments. They were merged by basic score maximisation with the classical 4 Hertz modulation energy. We validate this fusion approach with the use of the probability theory and the evidence theory. The system is tested on radio corpora. Systems are simple, robust and could be improved on every corpus without training or adaptation.
Keywords :
document handling; entropy; indexing; inference mechanisms; multimedia computing; music; probability; sensor fusion; signal classification; speech processing; basic score maximisation; descriptor fusion; entropy modulation; evidence theory; modulation energy; multimedia documents; music classification system; number-of-segments; probability theory; radio corpora; soundtrack indexing; speech classification system; stationary segment duration; Abstracts; Entropy; Filtering theory; Reliability; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2004 12th European
Conference_Location :
Vienna
Print_ISBN :
978-320-0001-65-7
Type :
conf
Filename :
7080022
Link To Document :
بازگشت