DocumentCode :
3717243
Title :
Early experience with optimizing I/O performance using high-performance SSDs for in-memory cluster computing
Author :
I. Stephen Choi;Weiqing Yang;Yang-Suk Kee
Author_Institution :
Memory Solutions Lab., Samsung Semiconductor Inc., San Jose, CA
fYear :
2015
Firstpage :
1073
Lastpage :
1083
Abstract :
This paper describes our experience with storage optimization that utilizes cost-effective PCIe solid-state drives (SSDs) to improve the overall performance of a Spark framework. A key problem we address is the limited memory system performance. In particular, we adopt high-performance SSDs to alleviate the saturated DRAM bandwidth and its limited capacity. We utilize SSDs to store shuffle data and persisted RDDs. As a result, the overall performance improves due to the larger capacity of SSDs and the increased bandwidth provided by SSDs while alleviating memory contentions. Our experiments show that we can improve the performance of data-intensive applications by 23.1% on average, compared to the performance of the memory-only approach. To our knowledge, this is the first work to demonstrate performance optimizations using PCIe SSDs on Spark.
Keywords :
"Sparks","Bandwidth","Random access memory","Memory management","Servers","Big data","Programming"
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/BigData.2015.7363861
Filename :
7363861
Link To Document :
بازگشت