Unsupervised feature extraction for multimedia event detection and ranking using audio content

Author

Amid, Ehsan ; Mesaros, Annamaria ; Palomaki, Kalle J. ; Laaksonen, Jorma ; Kurimo, Mikko

Author_Institution

Dept. of Signal Process. & Acoust., Aalto Univ., Espoo, Finland

fYear

2014

fDate

4-9 May 2014

Firstpage

5939

Lastpage

5943

Abstract

In this paper, we propose a new approach to classify and rank multimedia events based purely on audio content using video data from TRECVID-2013 multimedia event detection (MED) challenge. We perform several layers of nonlinear mappings to extract a set of unsupervised features from an initial set of temporal and spectral features to obtain a superior presentation of the atomic audio units. Additionally, we propose a novel weighted divergence measure for kernel based classifiers. The extensive set of experiments confirms that augmentation of the proposed steps results in an improved accuracy for most of the event classes.

Keywords

audio signal processing; audio streaming; feature extraction; multimedia systems; signal classification; spectral analysis; unsupervised learning; TRECVID-2013; atomic audio units; audio content; kernel based classifier; multimedia event classification; multimedia event detection; multimedia event ranking; nonlinear mapping; spectral features; temporal features; unsupervised feature extraction; video data; weighted divergence measure; Event detection; Feature extraction; Histograms; Multimedia communication; Noise reduction; Speech; Vectors; Bag of Words; Multimedia Event Detection; Stacked Denoising Autoencoders; Term Weighting; Unsupervised Feature Extraction; Weighted Jensen-Shannon Divergence;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location

Florence

Type

conf

DOI

10.1109/ICASSP.2014.6854743

Filename

6854743