• DocumentCode
    244581
  • Title

    Accelerating outlier detection with intra- and inter-node parallelism

  • Author

    Angiulli, Fabrizio ; Basta, Stefano ; Lodi, Stefano ; Sartori, Claudio

  • Author_Institution
    DIMES, UNICAL, Rende, Italy
  • fYear
    2014
  • fDate
    21-25 July 2014
  • Firstpage
    476
  • Lastpage
    483
  • Abstract
    Outlier detection is a data mining task consisting in the discovery of observations which deviate substantially from the rest of the data, and has many important practical applications. Outlier detection in very large data sets is however computationally very demanding and the size limit of the data that can be elaborated is considerably pushed forward by mixing three ingredients: efficient algorithms, intra-cpu parallelism of high-performance architectures, network level parallelism. In this paper we propose an outlier detection algorithm able to exploit the internal parallelism of a GPU and the external parallelism of a cluster of GPU. The algorithm is the evolution of our previous solutions which considered either GPU or network level parallelism. We discuss a set of large scale experiments executed in a supercomputing facility and show the speedup obtained with varying number of nodes.
  • Keywords
    data mining; parallel programming; data mining; high-performance architecture; inter-node parallelism; intra-CPU parallelism; intra-node parallelism; network level parallelism; outlier detection; supercomputing facility; Algorithm design and analysis; Arrays; Data mining; Graphics processing units; Instruction sets; Kernel; Parallel processing; Distance-based outliers; GPU; high performance computing; parallel algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing & Simulation (HPCS), 2014 International Conference on
  • Conference_Location
    Bologna
  • Print_ISBN
    978-1-4799-5312-7
  • Type

    conf

  • DOI
    10.1109/HPCSim.2014.6903723
  • Filename
    6903723