• DocumentCode
    560142
  • Title

    A similarity measure for time, frequency, and dependencies in large-scale workloads

  • Author

    Lassnig, Mario ; Fahringer, Thomas ; Garonne, Vincent ; Molfetas, Angelos ; Barisits, Martin

  • Author_Institution
    Inst. of Comput. Sci., Univ. of Innsbruck, Innsbruck, Austria
  • fYear
    2011
  • fDate
    12-18 Nov. 2011
  • Firstpage
    1
  • Lastpage
    11
  • Abstract
    Performance evaluations of large-scale systems require the use of representative workloads with certifiable similar or dissimilar characteristics. To quantify the similarity of the characteristics, we describe a novel measure comprising two efficient methods that are suitable for large-scale workloads. One method uses the discrete wavelet transform to assess the periodic time and frequency characteristics in the workload. The second method evaluates dependencies in descriptive attributes via association rule learning. Both methods are evaluated to find the limits of their similarity spaces. Additionally, the wavelet method is evaluated against existing similarity methods and tested for noise robustness and random bias. An empirical study using workloads from seven operational large-scale systems evaluates the measure´s accuracy. The results show that our measure is highly resistant to noise, well-suited for large-scale workloads, covers 87% of the possible similarity space, and improves accuracy by 24.5% and standard deviation by 10.8% when compared to existing work.
  • Keywords
    data mining; discrete wavelet transforms; learning (artificial intelligence); performance evaluation; association rule learning; dependencies evaluation; descriptive attributes; discrete wavelet transform; frequency characteristics; large-scale systems; large-scale workloads; noise robustness; performance evaluations; periodic time; random bias; similarity measure; Discrete wavelet transforms; Generators; Large-scale systems; Noise; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SC), 2011 International Conference for
  • Conference_Location
    Seatle, WA
  • Electronic_ISBN
    978-1-4503-0771-0
  • Type

    conf

  • Filename
    6114404