Title :
Parallel visualization on large clusters using MapReduce
Author :
Vo, H.T. ; Bronson, J. ; Summa, B. ; Comba, J.L.D. ; Freire, J. ; Howe, B. ; Pascucci, V. ; Silva, C.T.
Author_Institution :
Polytech. Inst., New York Univ., New York, NY, USA
Abstract :
Large-scale visualization systems are typically designed to efficiently “push” datasets through the graphics hardware. However, exploratory visualization systems are increasingly expected to support scalable data manipulation, restructuring, and querying capabilities in addition to core visualization algorithms. We posit that new emerging abstractions for parallel data processing, in particular computing clouds, can be leveraged to support large-scale data exploration through visualization. In this paper, we take a first step in evaluating the suitability of the MapReduce framework to implement large-scale visualization techniques. MapReduce is a lightweight, scalable, general-purpose parallel data processing framework increasingly popular in the context of cloud computing. Specifically, we implement and evaluate a representative suite of visualization tasks (mesh rendering, isosurface extraction, and mesh simplification) as MapReduce programs, and report quantitative performance results applying these algorithms to realistic datasets. For example, we perform isosurface extraction of up to l6 isovalues for volumes composed of 27 billion voxels, simplification of meshes with 30GBs of data and subsequent rendering with image resolutions up to 800002 pixels. Our results indicate that the parallel scalability, ease of use, ease of access to computing resources, and fault-tolerance of MapReduce offer a promising foundation for a combined data manipulation and data visualization system deployed in a public cloud or a local commodity cluster.
Keywords :
cloud computing; data visualisation; fault tolerance; image resolution; mesh generation; parallel processing; pattern clustering; query processing; rendering (computer graphics); MapReduce programs; cloud computing; combined data manipulation; computing clouds; computing resources; core visualization algorithms; data visualization system; exploratory visualization systems; fault-tolerance; graphics hardware; image resolutions; isosurface extraction; large clusters; large-scale data exploration; large-scale visualization systems; large-scale visualization techniques; local commodity cluster; mesh rendering; mesh simplification; parallel data processing framework; parallel scalability; parallel visualization; public cloud cluster; quantitative performance; querying capability; realistic datasets; restructuring capability; scalable data manipulation; subsequent rendering; visualization tasks; Runtime; Visualization; Hadoop; MapReduce; cloud computing; gigapixels; large meshes; volume rendering;
Conference_Titel :
Large Data Analysis and Visualization (LDAV), 2011 IEEE Symposium on
Conference_Location :
Providence, Rl
Print_ISBN :
978-1-4673-0156-5
DOI :
10.1109/LDAV.2011.6092321