DocumentCode :
1335213
Title :
High Performance Stereo Vision Designed for Massively Data Parallel Platforms
Author :
Yu, Wei ; Chen, Tsuhan ; Franchetti, Franz ; Hoe, James C.
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
20
Issue :
11
fYear :
2010
Firstpage :
1509
Lastpage :
1519
Abstract :
Real-time stereo vision is attractive in many applications like robot navigation and 3-D scene reconstruction. Data parallel platforms, e.g., graphics processing unit (GPU), are often used for real-time stereo, because most stereo algorithms involve a large portion of data parallel computations. In this paper, we propose a stereo system on GPU which pushes the Pareto-efficiency frontline in the accuracy and speed tradeoff space. Our system is based on a hardware-aware algorithm design approach. The system consists of new algorithms and code optimization techniques. We emphasize on keeping the highly data parallel structure in the algorithm design process such that the algorithms can be effectively mapped to massively data parallel platforms. We propose two stereo algorithms: namely, exponential step size adaptive weight (ESAW), and exponential step size message propagation (ESMP). ESAW reduces computational complexity without sacrificing disparity accuracy. ESMP is an extension of ESAW, which incorporates the smoothness term to better model non-frontal planes. ESMP offers additional choice in the accuracy and speed tradeoff space. We adopt code optimization methodologies from the performance tuning community, and apply them to this specific application. Such an approach gives higher performance than optimizing the code in an “ad hoc” manner, and helps understanding the code efficiency. Experiment results demonstrate a speedup factor of 2.7-8.5 over state-of-the-art stereo systems at comparable disparity accuracy.
Keywords :
coprocessors; image reconstruction; parallel processing; stereo image processing; 3D scene reconstruction; GPU; Pareto-efficiency frontline; code optimization; computational complexity; exponential step size adaptive weight; exponential step size message propagation; hardware-aware algorithm design; highly data parallel structure; massively data parallel platforms; nonfrontal planes; real-time stereo vision; robot navigation; stereo system; Accuracy; Algorithm design and analysis; Error analysis; Graphics processing unit; Optimization; Pixel; Real time systems; Code optimization; data parallel; graphics processing unit (GPU); multicore; real-time; stereo;
fLanguage :
English
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
Publisher :
ieee
ISSN :
1051-8215
Type :
jour
DOI :
10.1109/TCSVT.2010.2077771
Filename :
5585736
Link To Document :
بازگشت