Title of article :
A multimodal temporal panorama approach for moving vehicle detection, reconstruction and classification
Author/Authors :
Wang، نويسنده , , Tao and Zhu، نويسنده , , Zhigang and Taylor، نويسنده , , Clark N.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2013
Pages :
12
From page :
1724
To page :
1735
Abstract :
Moving vehicle detection and classification using multimodal data is a challenging task in data collection, audio-visual alignment, data labeling and feature selection under uncontrolled environments with occlusions, motion blurs, varying image resolutions and perspective distortions. In this work, we propose an effective multimodal temporal panorama approach for moving vehicle detection and classification using a novel long-range audio-visual sensing system. A new audio-visual vehicle (AVV) dataset is created, which features automatic vehicle detection and audio-visual alignment, accurate vehicle extraction and reconstruction, and efficient data labeling. In particular, vehicles’ visual images are reconstructed once detected in order to remove most of the occlusions, motion blurs, and variations of perspective views. Multimodal audio-visual features are extracted, including global geometric features (aspect ratios, profiles), local structure features (HOGs), as well various audio features (MFCCs, etc.). Using radial-based SVMs, the effectiveness of the integration of these multimodal features is thoroughly and systematically studied. The concept of MTP may not be only limited to visual, motion and audio modalities; it could also be applicable to other sensing modalities that can obtain data in the temporal domain.
Keywords :
multimodal , Laser-Doppler vibrometry , Panoramic imaging , Audio-visual integration
Journal title :
Computer Vision and Image Understanding
Serial Year :
2013
Journal title :
Computer Vision and Image Understanding
Record number :
1697080
Link To Document :
بازگشت