Title :
Research on Hadoop Cloud Computing Model and its Applications
Author :
Lu, Huang ; Hai-Shan, Chen ; Ting-Ting, Hu
Abstract :
Hadoop is an open-source software platform for distributed computing dealing with a parallel processing of large data sets. It has been widely used in the field of cloud computing. This paper describes the three most crucial parts of Hadoop, including HDFS, the distributed file system, MapReduce, the data processing model, and HBase, the distributed structured data table. The application status, main research directions and existing problems of Hadoop data processing platform are analyzed, and some performance optimization suggestions are given.
Keywords :
Cloud computing; Data processing; Distributed databases; Educational institutions; Optimization; Servers; HDFS; Hadoop; MapReduce; Performance Optimization;
Conference_Titel :
Networking and Distributed Computing (ICNDC), 2012 Third International Conference on
Conference_Location :
Hangzhou, China
Print_ISBN :
978-1-4673-2858-6
DOI :
10.1109/ICNDC.2012.22