• DocumentCode
    1906499
  • Title

    Data Complexity Measures and Nearest Neighbor Classifiers: A Practical Analysis for Meta-learning

  • Author

    Cavalcanti, G.D.C. ; Ren, T.I. ; Vale, B.A.

  • Author_Institution
    Center for Inf., Fed. Univ. of Pernambuco, Recife, Brazil
  • Volume
    1
  • fYear
    2012
  • fDate
    7-9 Nov. 2012
  • Firstpage
    1065
  • Lastpage
    1069
  • Abstract
    The classifier accuracy is affected by the properties of the data sets used to train it. Nearest neighbor classifiers are known for being simple and accurate in several domains, but their behavior is strongly dependent on data complexity. On the other hand, there are data complexity measures which aim to describe properties of the data sets. This work aims to show how data complexity measures can be efficiently used to predict the behavior of the Nearest Neighbor classifier. Seven data complexity measures and seventeen real datasets are used in the experimental study. Each data complexity measure is analyzed individually in order to find a relationship between its value and the accuracy of the classifier on a given dataset. No single measure used is good enough to predict the behavior of the Nearest Neighbor classifier. However, the combination of these measures provides a powerful tool to predict the accuracy of the Nearest Neighbor classifier.
  • Keywords
    learning (artificial intelligence); pattern classification; classifier accuracy; data complexity measures; meta-learning; nearest neighbor classifiers; Accuracy; Complexity theory; Density measurement; Error analysis; Measurement uncertainty; Pattern recognition; Shape measurement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence (ICTAI), 2012 IEEE 24th International Conference on
  • Conference_Location
    Athens
  • ISSN
    1082-3409
  • Print_ISBN
    978-1-4799-0227-9
  • Type

    conf

  • DOI
    10.1109/ICTAI.2012.150
  • Filename
    6495167