• DocumentCode
    3398857
  • Title

    An Architectural Characterization Study of Data Mining and Bioinformatics Workloads

  • Author

    Ozisikyilmaz, Berkin ; Narayanan, Ramanathan ; Zambreno, Joseph ; Memik, Gokhan ; Choudhary, Alok

  • Author_Institution
    Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL
  • fYear
    2006
  • fDate
    25-27 Oct. 2006
  • Firstpage
    61
  • Lastpage
    70
  • Abstract
    Data mining is the process of automatically finding implicit, previously unknown, and potentially useful information from large volumes of data. Advances in data extraction techniques have resulted in tremendous increase in the input data size of data mining applications. Data mining systems, on the other hand, have been unable to maintain the same rate of growth. Therefore, there is an increasing need to understand the bottlenecks associated with the execution of these applications in modern architectures. In this paper, we present MineBench, a publicly available benchmark suite containing fifteen representative data mining applications belonging to various categories: classification, clustering, association rule mining and optimization. First, we highlight the uniqueness of data mining applications. Subsequently, we evaluate the MineBench applications on an 8-way shared memory (SMP) machine and analyze important performance characteristics such as L1 and L2 cache miss rates, branch misprediction rates
  • Keywords
    benchmark testing; biology computing; cache storage; data mining; shared memory systems; L1 cache miss rates; L2 cache miss rates; MineBench; architectural characterization; association rule mining; bioinformatics workloads; branch misprediction rates; data extraction; data mining; performance analysis; shared memory machine; Algorithm design and analysis; Application software; Association rules; Bioinformatics; Computer science; Data engineering; Data mining; Multimedia databases; Performance analysis; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Workload Characterization, 2006 IEEE International Symposium on
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    1-4244-0508-4
  • Electronic_ISBN
    1-4244-0509-2
  • Type

    conf

  • DOI
    10.1109/IISWC.2006.302730
  • Filename
    4086134