Title :
A Case for Scaling HPC Metadata Performance through De-specialization
Author :
Patil, Swapnil ; Kai Ren ; Gibson, Garth
Abstract :
Lack of a highly scalable and parallel metadata service is the Achilles heel for many cluster file system deployments in both the HPC world and the Internet services world. This is because most cluster file systems have focused on scaling the data path, i.e. providing high bandwidth parallel I/O to files that are gigabytes in size. But with proliferation of massively parallel applications that produce metadata-intensive workloads, such as large number of simultaneous file creates and large-scale storage management, cluster file systems also need to scale metadata performance. To realize these goals, this paper makes a case for a scalable metadata service middleware that layers on existing cluster file system deployments and distributes file system metadata, including the namespace tree, small directories and large directories, across many servers. Our key idea is to effectively synthesize a concurrent indexing technique to distribute metadata with a tabular, on-disk representation of all file system metadata.
Keywords :
database indexing; input-output programs; meta data; middleware; parallel databases; software performance evaluation; storage management; Achilles heel; HPC metadata performance scaling; HPC world; Internet services world; cluster file system deployments; concurrent indexing technique; data path scaling; despecialization; file system metadata distribution; large directories; large-scale storage management; metadata-intensive workloads; namespace tree; on-disk representation; parallel metadata service; scalable metadata service middleware; small directories; file system; metadata; performance;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
Conference_Location :
Salt Lake City, UT
Print_ISBN :
978-1-4673-6218-4
DOI :
10.1109/SC.Companion.2012.372