DocumentCode :
3734004
Title :
The research for storage scheme based on Hadoop
Author :
Cong Jin;Shuang Ran
Author_Institution :
Media Audio &Video Key Laboratory, Communication University of China, Beijing, China
fYear :
2015
Firstpage :
62
Lastpage :
66
Abstract :
With the rise of big data, people begin to focus on storage of it. In this paper, first, we design four different storage solutions on Hadoop platform with HBase, MySQL, XML and plain text for specific data storage. Data is stored in Hadoop Distributed File System except for MySQL storage solution, in which data is stored directly in the Linux File System. Second, for each solution, we write specific program to meet the needs. Then we run each program respectively, record execution time and monitor system performance - CPU usage, memory usage and CPU load during the whole execution. Through analyzing and comparing the differences among storage solutions, we finally conclude the advantages, disadvantages and features of each one. The results can help us to select the most appropriate storage solution with these characteristics when processing data with Hadoop platform.
Keywords :
"XML","Big data","File systems","Linux","Memory","Distributed databases"
Publisher :
ieee
Conference_Titel :
Computer and Communications (ICCC), 2015 IEEE International Conference on
Print_ISBN :
978-1-4673-8125-3
Type :
conf
DOI :
10.1109/CompComm.2015.7387541
Filename :
7387541
Link To Document :
بازگشت