Title :
3D structure estimation from monocular video clips
Author :
Donate, Arturo ; Liu, Xiuwen
Author_Institution :
Dept. of Comput. Sci., Florida State Univ., Tallahassee, FL, USA
Abstract :
This paper explores the idea of extracting three dimensional features from a previously recorded video, in an attempt to provide three dimensional information about a video clip in order to improve the performance of various video analysis tasks. Although video analysis is a very prevalent area of research, the use of 3D features is scarce in the literature due to the inherent difficulties associated with extracting accurate 3D representations of videos in cases where no previous knowledge of the scene or camera is known. In this paper, we present a framework that attempts to compute a dense three dimensional representation of a scene using only the available video sequence. Our proposed system exploits the motion of the camera in order to estimate the relative 3D positions of salient features located in the video frames. Additionally, we incorporate the use of appearance-based models to estimate their relative poses and fit a 3D human model into the reconstructed scenes. We test our method using various video clips obtained from online databases in order to show the feasibility of this approach.
Keywords :
feature extraction; image reconstruction; pose estimation; solid modelling; video cameras; video databases; video signal processing; 3D human model; 3D structure estimation; monocular video clip; online database; salient feature; scene reconstruction; three dimensional feature extracting; video analysis; Cameras; Data mining; Feature extraction; Humans; Information analysis; Layout; Motion estimation; Performance analysis; Testing; Video sequences;
Conference_Titel :
Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
978-1-4244-7029-7
DOI :
10.1109/CVPRW.2010.5543795