DocumentCode :
720563
Title :
Scalable In-Memory Computing
Author :
Uta, Alexandru ; Sandu, Andreea ; Costache, Stefania ; Kielmann, Thilo
Author_Institution :
Dept. of Comput. Sci., VU Univ. Amsterdam, Amsterdam, Netherlands
fYear :
2015
fDate :
4-7 May 2015
Firstpage :
805
Lastpage :
810
Abstract :
Data-intensive scientific workflows are composed of many tasks that exhibit data precedence constraints leading to communication schemes expressed by means of intermediate files. In such scenarios, the storage layer is often a bottleneck, limiting overall application scalability, due to large volumes of data being generated during runtime at high I/O rates. To alleviate the storage pressure, applications take advantage of in-memory runtime distributed file systems that act as a fast, distributed cache, which greatly enhances I/O performance.In this paper, we present scalability results for MemFS, a distributed in-memory runtime file system. MemFS takes an opposite approach to data locality, by scattering all data among the nodes, leading to well balanced storage and network traffic, and thus making the system both highly per formant and scalable. Our results show that MemFS is platform independent, performing equally well on both private clusters and commercial clouds. On such platforms, running on up to 1024 cores, MemFS shows excellent horizontal scalability (using more nodes), while the vertical scalability (using more cores per node) is only limited by the network b and with. Further more, for this challenge we show how MemFS is able to scale elastically, at runtime, based on the application storage demands. In our experiments, we have successfully used up to 1TB memory when running a large instance of the Montage workflow.
Keywords :
distributed databases; network operating systems; storage management; MemFS; Montage workflow; application storage demands; data locality; distributed in-memory runtime file system; horizontal scalability; in-memory computing; network traffic; vertical scalability; Bandwidth; Databases; Partitioning algorithms; Random access memory; Resource management; Runtime; Scalability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
Conference_Location :
Shenzhen
Type :
conf
DOI :
10.1109/CCGrid.2015.106
Filename :
7152562
Link To Document :
بازگشت