DocumentCode :
3585111
Title :
Mapping of RAID Controller Performance Data to the Job History on Large Computing Systems
Author :
Hartung, Marc ; Kluge, Michael
Author_Institution :
Center for Inf. Services & High Performance Comput. (ZIH), Tech. Univ. Dresden, Dresden, Germany
fYear :
2014
Firstpage :
73
Lastpage :
80
Abstract :
For systems executing a mixture of different data intensive applications in parallel there is always the question about the impact that each application has on the storage subsystem. From the perspective of storage, I/O is typically anonymous as it does not contain user identifiers or similar information. This paper focuses on the analysis of performance data collected on shared system components like global file systems that can not be mapped back to user activities immediately. Our approach classifies user jobs based on their properties into classes and correlates these classes with global timelines. Within the paper we will show details of the clustering algorithm, depict our measurement environment and present first results. The results are valuable for tuning HPC storage system to achieve an optimized behavior on a global system level or to separate users into classes with different I/O demands.
Keywords :
Big Data; data analysis; input-output programs; parallel processing; pattern clustering; Big Data; HPC; I/O subsystem; RAID controller performance data mapping; clustering algorithm; data analysis; high performance computing system; shared system component; Bandwidth; Clustering algorithms; Context; Measurement; Partitioning algorithms; Performance analysis; Radiation detectors; Performance; Stochastic Analysis; Parallel File Systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Intensive Scalable Computing Systems (DISCS), 2014 International Workshop on
Type :
conf
DOI :
10.1109/DISCS.2014.7
Filename :
7079029
Link To Document :
بازگشت