Title :
The research for storage scheme based on Hadoop
Author :
Cong Jin;Shuang Ran
Author_Institution :
Media Audio &Video Key Laboratory, Communication University of China, Beijing, China
Abstract :
With the rise of big data, people begin to focus on storage of it. In this paper, first, we design four different storage solutions on Hadoop platform with HBase, MySQL, XML and plain text for specific data storage. Data is stored in Hadoop Distributed File System except for MySQL storage solution, in which data is stored directly in the Linux File System. Second, for each solution, we write specific program to meet the needs. Then we run each program respectively, record execution time and monitor system performance - CPU usage, memory usage and CPU load during the whole execution. Through analyzing and comparing the differences among storage solutions, we finally conclude the advantages, disadvantages and features of each one. The results can help us to select the most appropriate storage solution with these characteristics when processing data with Hadoop platform.
Keywords :
"XML","Big data","File systems","Linux","Memory","Distributed databases"
Conference_Titel :
Computer and Communications (ICCC), 2015 IEEE International Conference on
Print_ISBN :
978-1-4673-8125-3
DOI :
10.1109/CompComm.2015.7387541