Title :
Hierarchical Bloom filter arrays (HBA): a novel, scalable metadata management system for large cluster-based storage
Author :
Zhu, Yifeng ; Jiang, Hong ; Wang, Jun
Author_Institution :
Dept. of Comput. Sci. & Eng., Nebraska Univ., Lincoln, NE, USA
Abstract :
An efficient and distributed scheme for file mapping or file lookup scheme is critical in decentralizing metadata management within a group of metadata servers. This work presents a technique called HBA (hierarchical Bloom filter arrays) to map file names to the servers holding their metadata. Two levels of probabilistic arrays, i.e., Bloom filter arrays, with different accuracies are used on each metadata server. One array, with lower accuracy and representing the distribution of the entire metadata, trades accuracy for significantly reduced memory overhead, while the other array, with higher accuracy, caches partial distribution information and exploits the temporal locality of file access patterns. Extensive trace-driven simulations have shown our HBA design to be highly effective and efficient in improving performance and scalability of file systems in clusters with 1,000 to 10,000 nodes (or superclusters).
Keywords :
arrays; digital storage; distributed processing; file organisation; memory architecture; meta data; workstation clusters; file access patterns; file lookup; file mapping; hierarchical Bloom filter arrays; large cluster-based storage; metadata management; metadata server; partial distribution information caching; probabilistic arrays; superclusters; temporal locality; trace-driven simulations; Bandwidth; Computer networks; Computer science; Engineering management; File servers; High performance computing; Image storage; Information filtering; Information filters; Scalability;
Conference_Titel :
Cluster Computing, 2004 IEEE International Conference on
Print_ISBN :
0-7803-8694-9
DOI :
10.1109/CLUSTR.2004.1392614