DocumentCode :
3745894
Title :
Deep Spatial Pyramid Ensemble for Cultural Event Recognition
Author :
Xiu-Shen Wei;Bin-Bin Gao;Jianxin Wu
Author_Institution :
Nat. Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
fYear :
2015
Firstpage :
280
Lastpage :
286
Abstract :
Semantic event recognition based only on image-based cues is a challenging problem in computer vision. In order to capture rich information and exploit important cues like human poses, human garments and scene categories, we propose the Deep Spatial Pyramid Ensemble framework, which is mainly based on our previous work, i.e., Deep Spatial Pyramid (DSP). DSP could build universal and powerful image representations from CNN models. Specifically, we employ five deep networks trained on different data sources to extract five corresponding DSP representations for event recognition images. For combining the complementary information from different DSP representations, we ensemble these features by both "early fusion" and "late fusion". Finally, based on the proposed framework, we come up with a solution for the track of the Cultural Event Recognition competition at the ChaLearn Looking at People (LAP) challenge in association with ICCV 2015. Our framework achieved one of the best cultural event recognition performance in this challenge.
Keywords :
"Digital signal processing","Cultural differences","Image recognition","Feature extraction","Image representation","Image resolution","Training"
Publisher :
ieee
Conference_Titel :
Computer Vision Workshop (ICCVW), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/ICCVW.2015.45
Filename :
7406394
Link To Document :
بازگشت