Title :
Audiovisual vocal outburst classification in noisy acoustic conditions
Author :
Eyben, Florian ; Petridis, Stavros ; Schuller, Björn ; Pantic, Maja
Author_Institution :
Inst. for Human-Machine-Commun., Tech. Univ. Munchen, Munich, Germany
Abstract :
In this study, we investigate an audiovisual approach for classification of vocal outbursts (non-linguistic vocalisations) in noisy conditions using Long Short-Term Memory (LSTM) Recurrent Neural Networks and Support Vector Machines. Fusion of geometric shape features and acoustic low-level descriptors is performed on the feature level. Three different types of acoustic noise are considered: babble, office and street noise. Experiments are conducted on every noise type to asses the benefit of the fusion in each case. As database for evaluations serves the INTERSPEECH 2010 Paralinguistic Challenge´s Audiovisual Interest Corpus of human-to-human natural conversation. The results show that even when training is performed on noise corrupted audio which matches the test conditions the addition of visual features is still beneficial.
Keywords :
acoustic noise; acoustic signal processing; audio signal processing; audio-visual systems; pattern classification; recurrent neural nets; support vector machines; video signal processing; INTERSPEECH 2010 Paralinguistic Challenge; LSTM recurrent neural networks; acoustic low-level descriptors; acoustic noise; audio-visual vocal outburst classification; audiovisual interest corpus; babble noise; geometric shape feature fusion; human-to-human natural conversation; long short-term memory recurrent neural networks; noise corrupted audio; noisy acoustic conditions; office noise; street noise; support vector machines; visual features; Feature extraction; Noise measurement; Shape; Signal to noise ratio; Training; Visualization; Audiovisual Processing; Laughter; Long Short-Term Memory; Non-linguistic Vocalisations;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289067