Title :
Non-speech audio event detection
Author :
Elo, José Port ; Bugalho, Miguel ; Trancoso, Isabel ; Neto, João ; Abad, Alberto ; Serralheiro, António
Author_Institution :
INESC-ID, Lisboa
Abstract :
Audio event detection is one of the tasks of the European project VIDIVIDEO. This paper focuses on the detection of non-speech events, and as such only searches for events in audio segments that have been previously classified as non-speech. Preliminary experiments with a small corpus of sound effects have shown the potential of this type of corpus for training purposes. This paper describes our experiments with SVM and HMM-based classifiers, using a 290-hour corpus of sound effects. Although we have only built detectors for 15 semantic concepts so far, the method seems easily portable to other concepts. The paper reports experiments with multiple features, different kernels and several analysis windows. Preliminary experiments on documentaries and films yielded promising results, despite the difficulties posed by the mixtures of audio events that characterize real sounds.
Keywords :
audio signal processing; hidden Markov models; pattern classification; signal classification; support vector machines; HMM-based classifier; SVM; VIDIVIDEO; audio event detection; nonspeech audio event detection; Bandwidth; Brightness; Detectors; Event detection; Feature extraction; Hidden Markov models; Principal component analysis; Speech recognition; Support vector machine classification; Support vector machines; audio segmentation; event detection;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4959998