• DocumentCode
    2010754
  • Title

    SNN Input Parameters: How Are They Related?

  • Author

    Moreira, Guilherme ; Santos, Maribel Y. ; Moura-Pires, Joao

  • Author_Institution
    Algoritmi Res. Centre, Univ. of Minho, Guimaraes, Portugal
  • fYear
    2013
  • fDate
    15-18 Dec. 2013
  • Firstpage
    492
  • Lastpage
    497
  • Abstract
    Nowadays, organizations are facing several challenges when they try to analyze generated data with the aim of extracting useful information. This analytical capacity needs to be enhanced with tools capable of dealing with big data sets without making the analytical process a difficult task. Clustering is usually used, as this technique does not require any prior knowledge about the data. However, clustering algorithms usually require one or more input parameters that influence the clustering process and the results that can be obtained. This work analyses the relation between the three input parameters of the SNN (Shared Nearest Neighbor) algorithm and proposes specific guidelines for the identification of the appropriate input parameters that optimizes the processing time.
  • Keywords
    Big Data; data analysis; pattern clustering; SNN input parameters; big data sets; clustering algorithms; data analysis; information extraction; shared nearest neighbor algorithm; Algorithm design and analysis; Clustering algorithms; Correlation; Data mining; Graphical models; Guidelines; Noise; clustering; density-based clustering; input parameters tuning; shared nearest neighbor;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Systems (ICPADS), 2013 International Conference on
  • Conference_Location
    Seoul
  • ISSN
    1521-9097
  • Type

    conf

  • DOI
    10.1109/ICPADS.2013.89
  • Filename
    6808226