DocumentCode
7558
Title
IO Workload Characterization Revisited: A Data-Mining Approach
Author
Bumjoon Seo ; Sooyong Kang ; Jongmoo Choi ; Jaehyuk Cha ; Youjip Won ; Sungroh Yoon
Author_Institution
Dept. of Electr. & Comput. Eng., Seoul Nat. Univ., Seoul, South Korea
Volume
63
Issue
12
fYear
2014
fDate
Dec. 2014
Firstpage
3026
Lastpage
3038
Abstract
Over the past few decades, IO workload characterization has been a critical issue for operating system and storage community. Even so, the issue still deserves investigation because of the continued introduction of novel storage devices such as solid-state drives (SSDs), which have different characteristics from traditional hard disks. We propose novel IO workload characterization and classification schemes, aiming at addressing three major issues: (i) deciding right mining algorithms for IO traffic analysis, (ii) determining a feature set to properly characterize IO workloads, and (iii) defining essential IO traffic classes state-of-the-art storage devices can exploit in their internal management. The proposed characterization scheme extracts basic attributes that can effectively represent the characteristics of IO workloads and, based on the attributes, finds representative access patterns in general workloads using various clustering algorithms. The proposed classification scheme finds a small number of representative patterns of a given workload that can be exploited for optimization either in the storage stack of the operating system or inside the storage device.
Keywords
data mining; pattern clustering; storage management; IO traffic analysis; IO workload characterization; classification schemes; clustering algorithms; data-mining; feature set determination; internal management; representative access patterns; storage devices; Algorithm design and analysis; Clustering algorithms; Data mining; Feature extraction; Operating systems; Servers; IO workload characterization; SSD; classification; clustering; storage and operating systems;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/TC.2013.187
Filename
6598677
Link To Document