DocumentCode
179729
Title
Unsupervised feature extraction for multimedia event detection and ranking using audio content
Author
Amid, Ehsan ; Mesaros, Annamaria ; Palomaki, Kalle J. ; Laaksonen, Jorma ; Kurimo, Mikko
Author_Institution
Dept. of Signal Process. & Acoust., Aalto Univ., Espoo, Finland
fYear
2014
fDate
4-9 May 2014
Firstpage
5939
Lastpage
5943
Abstract
In this paper, we propose a new approach to classify and rank multimedia events based purely on audio content using video data from TRECVID-2013 multimedia event detection (MED) challenge. We perform several layers of nonlinear mappings to extract a set of unsupervised features from an initial set of temporal and spectral features to obtain a superior presentation of the atomic audio units. Additionally, we propose a novel weighted divergence measure for kernel based classifiers. The extensive set of experiments confirms that augmentation of the proposed steps results in an improved accuracy for most of the event classes.
Keywords
audio signal processing; audio streaming; feature extraction; multimedia systems; signal classification; spectral analysis; unsupervised learning; TRECVID-2013; atomic audio units; audio content; kernel based classifier; multimedia event classification; multimedia event detection; multimedia event ranking; nonlinear mapping; spectral features; temporal features; unsupervised feature extraction; video data; weighted divergence measure; Event detection; Feature extraction; Histograms; Multimedia communication; Noise reduction; Speech; Vectors; Bag of Words; Multimedia Event Detection; Stacked Denoising Autoencoders; Term Weighting; Unsupervised Feature Extraction; Weighted Jensen-Shannon Divergence;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854743
Filename
6854743
Link To Document