DocumentCode :
3745952
Title :
Audio-Visual Classification of Sports Types
Author :
Rikke Gade;Mohamed Abou-Zleikha;Mads Gr?sb?ll ;Thomas B. Moeslund
Author_Institution :
Visual Anal. of People Lab., Aalborg Univ., Aalborg, Denmark
fYear :
2015
Firstpage :
768
Lastpage :
773
Abstract :
In this work we propose a method for classification of sports types from combined audio and visual features extracted from thermal video. From audio Mel Frequency Cepstral Coefficients (MFCC) are extracted, and PCA are applied to reduce the feature space to 10 dimensions. From the visual modality short trajectories are constructed to represent the motion of players. From these, four motion features are extracted and combined directly with audio features for classification. A k-nearest neighbour classifier is applied for classification of 180 1-minute video sequences from three sports types. Using 10-fold cross validation a correct classification rate of 96.11% is obtained with multimodal features, compared to 86.67% and 90.00% using only visual or audio features, respectively.
Keywords :
"Feature extraction","Cameras","Visualization","Mel frequency cepstral coefficient","Trajectory","Target tracking"
Publisher :
ieee
Conference_Titel :
Computer Vision Workshop (ICCVW), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/ICCVW.2015.104
Filename :
7406453
Link To Document :
بازگشت