Title :
Classification Based Metadata Management for HDFS
Author :
Chandrasekar, Ashok ; Chandrasekar, Karthik ; Ramasatagopan, Harini ; Rafica, A.R. ; Balasubramaniyan, Jagadeesh
Author_Institution :
Dept. of Inf. Technol., Anna Univ., Chennai, India
Abstract :
The way data storage is viewed has been changing consistently. The current trend in data storage is Hadoop which provides a scalable data storage mechanism for storing extremely large amount of data and to handle data intensive scientific applications. It makes use of the MapReduce framework and stores the data in HDFS(Hadoop Distributed File System). In HDFS architecture metadata is handled by the NameNode. In this paper, we propose a novel and efficient mechanism for managing the metadata dynamically by classifying the metadata based on its Importance Factor(If) which is a measure of the data´s criticality, frequency of access and the importance of the client using the data. Metadata management is divided into three different techniques based on the importance. To save the amount of metadata from being a constraint on the main memory of the NameNode, the concept of sequence files is employed. This approach leads to more efficient low latency metadata operations, at the same time reduces the bottleneck of the NameNode main memory.
Keywords :
distributed databases; meta data; parallel programming; pattern classification; public domain software; storage management; HDFS architecture; Hadoop distributed file system; MapReduce; NameNode; data classification; data criticality measure; data intensive scientific application; data storage mechanism; file sequence; importance factor; metadata management; File systems; Java; Memory management; Queueing analysis; Time factors; HDFS; Hadoop; classification; metadata management; namenode;
Conference_Titel :
High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
Conference_Location :
Liverpool
Print_ISBN :
978-1-4673-2164-8
DOI :
10.1109/HPCC.2012.149