DocumentCode :
1829353
Title :
Classification Based Metadata Management for HDFS
Author :
Chandrasekar, Ashok ; Chandrasekar, Karthik ; Ramasatagopan, Harini ; Rafica, A.R. ; Balasubramaniyan, Jagadeesh
Author_Institution :
Dept. of Inf. Technol., Anna Univ., Chennai, India
fYear :
2012
fDate :
25-27 June 2012
Firstpage :
1021
Lastpage :
1026
Abstract :
The way data storage is viewed has been changing consistently. The current trend in data storage is Hadoop which provides a scalable data storage mechanism for storing extremely large amount of data and to handle data intensive scientific applications. It makes use of the MapReduce framework and stores the data in HDFS(Hadoop Distributed File System). In HDFS architecture metadata is handled by the NameNode. In this paper, we propose a novel and efficient mechanism for managing the metadata dynamically by classifying the metadata based on its Importance Factor(If) which is a measure of the data´s criticality, frequency of access and the importance of the client using the data. Metadata management is divided into three different techniques based on the importance. To save the amount of metadata from being a constraint on the main memory of the NameNode, the concept of sequence files is employed. This approach leads to more efficient low latency metadata operations, at the same time reduces the bottleneck of the NameNode main memory.
Keywords :
distributed databases; meta data; parallel programming; pattern classification; public domain software; storage management; HDFS architecture; Hadoop distributed file system; MapReduce; NameNode; data classification; data criticality measure; data intensive scientific application; data storage mechanism; file sequence; importance factor; metadata management; File systems; Java; Memory management; Queueing analysis; Time factors; HDFS; Hadoop; classification; metadata management; namenode;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
Conference_Location :
Liverpool
Print_ISBN :
978-1-4673-2164-8
Type :
conf
DOI :
10.1109/HPCC.2012.149
Filename :
6332285
Link To Document :
بازگشت